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NOVEL ODORANT RECEPTORS IN DROSOPHILA 



INVENTORS: John R. Carlson, Junhyong Kim, Peter J. Clyne, Coral G. Warr 

5 RELATED APPLICATIONS 

This application claims priority to U.S. provisional patent application Serial No. 
60/1 17,132 filed January 25, 1999 which is herein incorporated by reference in its 
entirety. 

10 U.S. GOVERNMENT SUPPORT 

This work was supported by a grant from the National Institutes of Health (DC- 
02174). 

FIELD OF THE INVENTION 

1 5 This invention pertains to novel olfactory receptors and to methods of using such 

receptors. More particularly, this invention pertains to the nucleic acids and amino acids 
of novel olfactory receptors in Drosophila and to methods of using such nucleic acids and 
amino acids. 

20 BACKGROUND OF THE INVENTION 

Animals can detect a vast array of odors with remarkable sensitivity and 
discrimination. Olfactory information is first received by olfactory receptor neurons 
(olfactory receptors), which transmit signals into the central nervous system (CNS) where 
they are processed, ultimately leading to behavioral responses. An enormous amount of 
25 investigation into olfactory function, organization, and development has been carried out 
in insect model systems for many years (Kaissling et al., (1987) Ann. NY Acad. Sci. 510, 
104-1 12; Hildebrand (1995) Proc. Natl. Acad. Sci. USA 92, 67-74). However, a number 
of central questions have been refractory to incisive analysis because the receptor 
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molecules to which odor molecules bind have not been identified, in any insect. 

To investigate the molecular mechanisms of olfactory function and development, 
applicants studied the olfactory system of Drosophila melanogaster, which is highly 
sensitive and capable of odor discrimination (Siddiqi, (1991) Olfaction in Drosophila, in: 
5 Wysocki & Kare (ed.), Chemical Senses, Marcel Dekker; Carlson (1996) Trends Genet. 
12, 175-180). There are two olfactory organs on the adult fly, the third segment of the 
antenna and the maxillary palp (Figure 1 A). In both organs, olfactory receptors are 
housed in sensory hairs called sensilla. The organization of the approximately 1200 
olfactory receptors of the antenna is complex but ordered. On the antenna there are 

10 different morphological categories of sensilla: s. trichodea, s. coeloconica, large s. 

basiconica, and small s. basiconica (Figure IB). The different morphological categories of 
sensilla are distributed in overlapping patterns across the surface of the antenna (Figures 
1C-F) (Venkatesh & Singh, (1984) Int. J. Insect Morphol. Embryol. 13, 51-63; Stacker, 
(1994) Roux's Arch. Dev. Biol. 205, 62-72). 

15 Electrophysiological studies show that each morphological category of sensilla can 

be divided into different functional types (denoted by different colors in Figures 1C-F), 
defined by the characteristic response profiles of their olfactory receptors (Rodrigues et 
al., (1991) Mol. Gen. Genet. 226, 265-276; Clyne et al, (1997) Invert. Neurosci. 3, 
127-135; de Bruyne et al, unpublished results). For s. trichodea, the different functional 

20 types are segregated into zones on the surface of the antenna (Figure 1C); segregation is 
also observed for the different functional types of s. coeloconica (Figure ID). This zonal 
organization is less conspicuous for the large and small s. basiconica, of which different 
functional types are intermingled (Figures 1E-F). Electrophysiological data suggest that 
there are on the order of thirty different classes of olfactory receptors in the antenna, a 

25 rough estimate based upon the odor response profiles of individual olfactory receptors 
(and in a few cases, the assumption that the neurons of particular functional types of 
sensilla have unique response profiles). 

In contrast to the antenna, the organization of the approximately 120 olfactory 
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receptors of the maxillary palp is less complex. There are approximately 60 s. basiconica 
on the maxillary palp, each housing two olfactory receptors (Singh & Nayak, (1985) Int. 
J. Insect Morphol. Embryol. 14, 291-306). The 120 olfactory receptors fall into six 
different classes based upon their odorant response profiles (Clyne et al., (1999) Neuron 
5 22, 339-347; de Bruyne et al, (1999) J. Neurosci. 19, 4520-4532). Neurons of the six 
ORN classes are always found in characteristic pairs in three functional types of s. 
basiconica, with the total number of neurons in each class being equal. Each class is 
distributed broadly over all, or almost all, of the olfactory surface of the maxillary palp. 
Thus electrophysiological and anatomical studies suggest that there are on the order 

1 0 of thirty-five classes of olfactory receptors in the adult fly (approximately thirty on the 
antenna and six on the palp), each class with a distinct odor sensitivity. Classes of 
olfactory receptors found in the antenna are arrayed in zones, while the classes of 
olfactory receptors found in the maxillary palp are distributed in a less ordered fashion, 
olfactory receptors in both the maxillary palp and the antenna extend their axons to the 

1 5 antennal lobe of the brain, where first-order processing of olfactory information occurs. 
The lobe contains approximately forty olfactory glomeruli, spheroidal modules where 
ORN axons converge and where their terminal branches form synapses with the dendrites 
of their target interneurons (Stocker, (1994) Cell Tissue Res. 275, 3-26; Hildebrand & 
Shepherd, (1997) Annu. Rev. Neurosci. 20, 595-631). 

20 One possibility underlying the molecular basis for distinct odor sensitivities for 

different classes of olfactory receptors is that each class of ORN expresses a unique 
odorant receptor, as has been proposed for vertebrate olfactory systems (Ngai et al., 
(1993) Cell 72, 667-680; Ressler et al, (1993) Cell 73, 597-609; Vassar et al., (1993) Cell 
74, 309-318; Buck, (1996) Annu. Rev. Neurosci. 19, 517-544; Hildebrand & Shepherd, 

25 (1 997) Annu. Rev. Neurosci. 20, 595-63 1). Alternatively, each class of ORN might 

express a unique combination of a large set of receptors, as found in chemosensory cells 
of the nematode, C. elegans (Troemel et al., (1995) Cell 83, 207-218). Both models call 
for a family of receptor genes, and several lines of evidence suggest that for insects such a 
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family would belong to the superfamily of seven-transmembrane G protein-coupled 
receptors (GPCRs). First, there is evidence that insects generate responses to odorants via 
GPCR-activated second-messenger systems. For example, a rapid and transient increase 
in inositol 1,4,5-trisphosphate (IP3) has been observed in response to stimulation with 
5 pheromone and other odors using antennal preparations from various insect species (Breer 
et ah, (1990) Nature 345, 65-68; Boekhoff et al, (1993) Insect Biochem. Mol. Biol. 23, 
757-762; Wegener et al, (1993) J. Insect Physiol. 39, 153-163). This increase in IPs can 
be blocked by pertussis toxin, implicating a G protein signaling cascade (Boekhoff et al, 
(1990) Cell. Signal. 2, 49-56). In Drosophila, norpA mutants, which lack the 

10 phospholipase C that is an essential component of phototransduction, also exhibit reduced 
olfactory responses of the maxillary palp (Riesgo-Escovar et al, (1995) J. Comp. Physiol. 
A180, 151-160). A second reason to suspect that odorant receptors in Drosophila are 
GPCRs is that GPCRs have been shown to be odorant receptors in both vertebrates and C. 
elegans; moreover, abundant evidence indicates that olfactory information in these other 

15 organisms is transduced by GPCR-activated second messenger systems (Buck, (1996) 
Annu. Rev. Neurosci. 19, 517-544; Bargmann & Kaplan, (1998) Annu. Rev. Neurosci. 
21, 279-308). It would thus seem unlikely that a family of receptors that have a 
completely novel structure and that use a completely different transduction mechanism 
would have arisen in insects. 

20 There have been extensive efforts to identify odorant and pheromone receptors in a 

variety of insects using a wide range of strategies. These efforts have been driven in part 
by interest in analyzing receptor genes in the context of highly tractable experimental 
systems in which there is a wealth of knowledge about olfactory function and 
organization. For example, Drosophila offers the advantages of a model genetic 

25 organism together with the ability to measure olfactory function conveniently in vivo, 

through either physiological or behavioral means. Interest in insect odorant receptors has 
also arisen because of the critical role of olfaction in the attraction of many insect pests to 
their plant hosts, of insect vectors of disease to their human hosts, and of insects to their 
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mates. Nevertheless, efforts to identify odorant receptors in insects, based upon searches 
for genes bearing sequence similarities to odorant receptor genes from other organisms, or 
on other strategies, have been unsuccessful. 

Applicants have discovered a novel multigene family encoding candidate odorant 
5 receptors that were identified from the Drosophila genomic sequence database. The 
forty-nine genes described here were discovered using novel computer programs that 
identify diagnostic features of the protein structure of the seven-transmembrane GPCR 
superfamily. Members of this new family are highly divergent from previously defined 
genes. Nearly all of the genes are found to be expressed in one or both of the olfactory 
1 0 organs, and for a number of genes expression is restricted to a subset of olfactory 

receptors. Applicant's further demonstrate that expression of different genes is initiated at 
different times during the development of the adult antenna, and that expression of a 
subset of these candidate receptor genes depends on the POU domain transcription factor, 
Acj6 (abnormal chemosensoryjump 6). 

15 

SUMMARY OF THE INVENTION 

This invention provides isolated nucleic acid molecules including the following: 
a) isolated nucleic acid molecules that encode the amino acid sequences of 
Drosophila Odorant Receptor proteins; 
20 b) isolated nucleic acid molecules that encode protein fragments of at least 6 

amino acids of a Drosophila Odorant Receptor proteins; and 

c) isolated nucleic acid molecules which hybridize to nucleic acid molecules 
which include nucleotide sequences encoding Drosophila Odorant Receptor proteins 
under conditions of sufficient stringency to produce a clear signal. 
25 This invention also provides such isolated nucleic acid molecules wherein the 

nucleic acids include at least one exon-intron boundary located in one of the following 
positions: 

a) the nucleotides encoding the amino acids which include the third extracellular 
-5 - 
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domain of a Drosophila Odorant Receptor protein; 

b) the nucleotides encoding the amino acids which include the fourth extracellular 
domain of a Drosophila Odorant Receptor protein; and 

c) the nucleotides encoding the amino acids which include the fourth intracellular 
5 domain of a Drosophila Odorant Receptor protein. 

This invention further provides such isolated nucleic acid molecules which have the 
nucleic acid sequence of one of the following sequences: SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 
13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 
61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95 and 97. 
10 This invention also provides such isolated nucleic acid molecules operably linked to 

one or more expression control elements. 

This invention further provides vectors which include any of the aforementioned 
nucleic acid molecules and host cells which include such vectors.. 

This invention also provides host cells transformed so as to contain any of the 
15 aforementioned nucleic acid molecules, wherein such host cells can be either prokaryotic 
host cells or eukaryotic host cells. 

This invention also provides methods for producing proteins or protein fragments 
wherein the methods include transforming host cells with any of the aforementioned 
nucleic acids under conditions in which the protein or protein fragment encoded by said 
20 nucleic acid molecule is expressed. This invention also provides such methods wherein 
the host cells are either prokaryotic host cells or eukaryotic host cells. This invention 
further provides isolated proteins or protein fragments produced by such methods. 

This invention provides isolated proteins or protein fragments which include: 

a) isolated proteins encoded by one of the following amino acid sequences: SEQ 
25 ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 

48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 
96 and 98; 

b) isolated protein fragments which include at least 6 amino acids of any of the 
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following sequences: SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 
34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 
82, 84, 86, 88, 90, 92, 94, 96 and 98; 

c) isolated proteins which include conservative amino acid substitutions of any of 
5 the following sequences: SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 

32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 
80, 82, 84, 86, 88, 90, 92, 94, 96 and 98; and 

d) naturally occurring amino acid sequence variants of any of the following 
sequences: SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 

10 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 
86, 88, 90, 92, 94, 96 and 98. 

The present invention further provides such isolated proteins or protein fragments 
which include at least one of the following conserved amino acids: 

a) Leucine in the third extracellular domain of a Drosophila Odorant Receptor 
1 5 protein; 

b) Histidine in the third extracellular domain of a Drosophila Odorant Receptor 
protein; 

c) Cysteine in the sixth transmembrane domain of a Drosophila Odorant Receptor 
protein; 

20 d) Tryptophan in the fourth extracellular domain of a Drosophila Odorant 

Receptor protein; 

e) Glutamine in the seventh transmembrane domain of a Drosophila Odorant 
Receptor protein; 

f) Proline in the seventh transmembrane domain of a Drosophila Odorant 
25 Receptor protein; 

g) Alanine in the fourth intracellular domain of a Drosophila Odorant Receptor 
protein; and 

h) Tyrosine in the fourth intracellular domain of a Drosophila Odorant Receptor 
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protein. 

The present invention also provides isolated antibodies that bind to any of the 
aforementioned polypeptides. 

The present invention also provides such antibodies which are either monoclonal 
5 antibodies or polyclonal antibodies. 

This invention also provides methods of identifying agents which modulate the 
expression of any of the aforementioned proteins or protein fragments by: 

a) exposing cells which express the proteins or protein fragments to the agents; 

and 

10 b) determining whether the agent modulates expression of said proteins or protein 

fragments, thereby identifying agents which modulate the expression of the proteins or 
protein fragments. 

The present invention also provides methods of identifying agents which modulate 
the activity of any of the aforementioned proteins or protein fragments by: 
15 a) exposing cells which express the proteins or protein fragments to the agents; 

and 

b) determining whether the agents modulate the activity of said proteins or protein 
fragments, thereby identifying agents which modulate the activity of the proteins or 
protein fragments. 

20 The present invention also provides such methods where the agent modulates at least 

one activity of the proteins or protein fragments. 

This invention provides methods of identifying agents which modulate the 
transcription of any of the aforementioned nucleic acid molecules by: 

a) exposing cells which transcribe the nucleic acids to the agents; and 
25 b) determining whether the agents modulate transcription of said nucleic acids, 

thereby identifying agents which modulate the transcription of the nucleic acid. 

This invention further provides methods of identifying binding partners for the 
aforementioned proteins or protein fragments by: 



Attorney Docket No. 44574-5061 

a) exposing said proteins or protein fragments to potential binding partners; and 

b) determining if the potential binding partners bind to said proteins or protein 
fragments, thereby identifying binding partners for the proteins or protein fragments. 

The present invention also provides methods of modulating the expression of nucleic 
5 acids encoding the aforementioned proteins or protein fragments by administering an 

effective amount of agents which modulate the expression of the nucleic acids encoding 
the proteins or protein fragments. 

This invention also provides methods of modulating at least one activity of the 
aforementioned proteins or protein fragments by administering an effective amount of the 
10 agents which modulate at least one activity of the proteins or protein fragments. 

This invention provides methods of identifying novel olfactory receptor genes by: 

a) selecting candidate olfactory receptor genes by screening nucleic acid databases 
using an algorithm trained to identify seven transmembrane receptors genes; 

b) screening said selected candidate olfactory receptor genes by identifying 

1 5 nucleic acid sequences with conserved amino acid residues and intron-exon boundaries 

common to olfactory receptors, and having open reading frames of sufficient size so as to 
encode a seven transmembrane receptor; and 

c) identifying the novel olfactory receptor genes and measuring the expression of 
olfactory receptor genes wherein the detection of expression confirms said candidate 

20 olfactory genes as olfactory genes. 

This invention also provides methods of identifying novel olfactory receptor genes 

by: 

a) selecting candidate olfactory receptor genes by screening nucleic acid databases 
for nucleic acid sequences with sufficient homology to at least one known olfactory 

25 receptor gene; 

b) screening said selected candidate olfactory receptor genes by identifying 
nucleic acids with conserved amino acid residues and intron-exon boundaries common to 
olfactory receptors, and having open reading frames of sufficient size so as to encode a 
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seven transmembrane receptor; and 

c) identifying the novel olfactory receptor genes and measuring the expression of 
olfactory receptor genes wherein the detection of expression confirms said candidate 
olfactory genes as olfactory genes. 

The present invention also provides transgenic insects modified to contain any of the 
aforementioned nucleic acid molecules. 

This invention also provides such transgenic insects, wherein the nucleic acid 
molecules contain mutations that alter expression of the encoded proteins. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 An overview of the olfactory system of the adult Drosophila. (A) The two 
olfactory organs of the adult fly, the third antennal segment (arrow) and the maxillary 
palp (arrowhead), scale bar = 100 p. (B) Higher magnification of part of a third 
antennal segment showing the morphological categories of olfactory sensilla: s. 
basiconica [B], s. trichodea [T] and s. coeloconica [C], scale bar = 5 um. (C-F) Diagram 
of the olfactory sensilla on the anterior face of the third antennal segment. The different 
morphological categories of sensilla are indicated by different shapes, and the colors 
indicate different functional types of sensilla within each morphological category. Dorsal 
is at the top and medial is to the left. (C) Distribution of different functional types of s. 
trichodea. (D) Distribution of different functional types of s. coeloconica. (E) The large s. 
basiconica are densely clustered in a small dorso-medial region, where the different 
functional types are intermingled. For simplicity, only two types are shown. (F) The 
small s. basiconica are widely dispersed, and the different functional types are 
intermingled. 

Figure 2 Genomic organization and hydropathy plots of DOR genes. (A) Genomic 
organization of DOR genes (not to scale). The genes shown are those identified from 
16% of the total genomic sequence; most of the available sequence is from Chromosome 
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2. The approximate chromosomal location of each gene is indicated. Genes separated by 
less than one kilobase are jointly underlined. Within each cluster, all genes are oriented 
in the same direction. The transcriptional orientation of the DOR genes with respect to 
the chromosome is unknown for 2F.1, 25A.1, 47E.2, 59D.1, and the cluster at 33B. (B) 
The 2F.1 gene is flanked by two closely linked genes, fs(l)kl0 and crn. The arrowheads 
indicate the 3 ' end of each gene; for 2F.1 the end of the arrow indicates the position of 
the polyA+ addition signal sequence. (C) Hydropathy plots of the genes whose 
expression patterns are shown in Figures 4-6. Hydrophobic peaks predicted by 
Kyte-Doolittle analysis appear above the center line. The approximate positions of the 
seven putative transmembrane domains are indicated above the first hydropathy plot. 

Figure 3 Amino acid sequence alignment of DOR genes. All DNA sequences were 
obtained from the BDGP database, and the determination of predicted amino acid 
sequences is described in the Examples. Residues conserved in >50% of the predicted 
proteins are shaded. The approximate locations of predicted transmembrane domains 1-7 
are indicated. Exon-intron boundaries are shown with vertical lines. 

Figure 4 DOR genes are expressed in subsets of olfactory receptor neurons in the 
maxillary palp. In situ hybridizations to tissue sections of maxillary palps. Panel A 
shows a frontal section; all other sections are sagittal. (A) A 46F.1 probe reveals 
expression in a subset of olfactory receptors which are broadly distributed. The 
background staining at the periphery of the organ represents non-specific labeling of the 
cuticle, observed equally for sense and antisense probes. (B) A 33B.3 probe also 
hybridizes to a subset of cells. Unlabeled olfactory receptors are visible under the 
cuticular surface (top center). (C) At higher magnification it can be seen that the cells 
expressing 46F.1 are neurons. Note the axons projecting from the cells into the nerve (n) 
which runs through the middle of the maxillary palp. The arrowhead indicates an ORN 
which is not expressing 46F.1, adjacent to an ORN which is strongly stained. The light 
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staining of the nerve is background staining, observed equally for sense and antisense 
probes. (D) 33B.3 is not expressed in the acj6 null mutant, acj6 6 . 

Figure 5 DOR genes are expressed in subsets of antennal cells. Shown are in situ 
5 hybridizations to tissue sections of third antennal segments. In panels A, B, D, and F the 
plane of section passes through the fluid-filled interior of the antenna. (A,B) A 47E.1 
probe hybridizes to a subset of cells which are broadly distributed. (C,D) A 25 A. 1 probe 
hybridizes to a smaller subset of cells. The angle of section in panel C differs somewhat 
from the other panels. (E) A 22A.2 probe hybridizes to a subset of cells in the 
1 0 dorso-medial region where the large s. basiconica are located. (F) 22A.2 is expressed in 
the acj6 6 mutant, in contrast to 33B.3 (Figure 4D). (G) Summary of distributions of 
labeled cells for 47E.1 (open circles), 25 A. 1 (black dots), and 22A.2 (gray dots) on the 
anterior face of the antenna, based on analysis of expression in 30-50 antennae for each 
gene. 

15 

Figure 6 Expression of DOR genes during antennal development. In situ 
hybridizations to tissue sections of third antennal segments at different times during pupal 
development. The times indicated refer to hours APF (after puparium formation). Arrows 
indicate labeled cells. (A) Expression of 22A.2 is not observed at 54 hours APF. Note 

20 that background staining is absent in sections taken at 54 hours (or at earlier times), 

presumably due to the immaturity of the cuticle. (B) Expression of 22A.2 is observed at 
60 hours APF. (C) 47E.1 expression is not observed at 72 hours APF. Background 
staining is observed with both sense and antisense probes on the cuticular surface of the 
sacculus (s), a multi-chambered sensory pit and the dot at the bottom of the third antennal 

25 segment is non-specific staining of a section of tracheal tissue. (D) Expression of 47E. 1 
is detected at 93 hours APF. (E) The odor binding protein OS-E is not expressed at 72 
hours APF. The small dots at the bottom of the antenna are non-specific staining of a 
section of tracheal tissue, observed with both sense and antisense probes. (F) Abundant 
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expression of OS-E is seen at 93 hours APF. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

I. Specific Embodiments 

A. Drosophila Olfactory Receptor Proteins 

The present invention provides a family of isolated proteins, allelic variants of the 
proteins, and conservative amino acid substitutions of the proteins. As used herein, 
protein or polypeptide refers to any one of the proteins that has the amino acid sequence 
depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 
38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 
86, 88, 90, 92, 94, 96 and 98. The invention also includes naturally occurring allelic 
variants and proteins that have a slightly different amino acid sequence than that 
specifically recited above. Allelic variants, though possessing a slightly different amino 
acid sequence than those recited above, will still have the same or similar biological 
functions associated with any of the amino acid proteins. 

As used herein, the family of proteins related to any one of the amino acid sequences 
depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 
38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 
86, 88, 90, 92, 94, 96 and 98 refers to proteins that have been isolated from organisms in 
addition to Drosophila. The methods used to identify and isolate other members of the 
family of proteins related to these amino acid proteins are described below. 

The proteins of the present invention are preferably in isolated form. As used herein, 
a protein is said to be isolated when physical, mechanical or chemical methods are 
employed to remove the protein from cellular constituents that are normally associated 
with the protein. A skilled artisan can readily employ standard purification methods to 
obtain an isolated protein. 

The proteins of the present invention further include conservative amino acid 
substitution variants (i.e., conservative) of the proteins herein described. As used herein, 
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a conservative variant refers to at least one alteration in the amino acid sequence that does 
not adversely affect the biological functions of the protein. A substitution, insertion or 
deletion is said to adversely affect the protein when the altered sequence prevents or 
disrupts a biological function associated with the protein. For example, the overall 

5 charge, structure or hydrophobic-hydrophilic properties of the protein can be altered 

without adversely affecting a biological activity. Accordingly, the amino acid sequence 
can often be altered, for example to render the peptide more hydrophobic or hydrophilic, 
without adversely affecting the biological activities of the protein. 

Ordinarily, the allelic variants, the conservative substitution variants, and the 

1 0 members of the protein family, will have an amino acid sequence having at least 30% 
amino acid sequence identity with the sequences set forth in SEQ ID NO: 2, 4, 6, 8, 10, 
12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 
60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96 and 98 more 
preferably at least 35%, even more preferably at least 40%, and most preferably at least 

15 45%. Identity or homology with respect to such sequences is defined herein as the 

percentage of amino acid residues in the candidate sequence that are identical with the 
known peptides, after aligning the sequences and introducing gaps, if necessary, to 
achieve the maximum percent homology, and not considering any conservative 
substitutions as part of the sequence identity. N-terminal, C-terminal or internal 

20 extensions, deletions, or insertions into the peptide sequence shall not be construed as 
affecting homology. 

In addition to amino acid sequence identity, the proteins of the present invention 
have seven transmembrane domains as defined by hydropathy analysis (Kyte & Doolittle, 
(1982) J. Mol. Biol. 157, 105-132). Furthermore, the proteins of the present invention 
25 have conserved amino acid residues in defined domains of the protein. For example, the 
proteins of the present invention have at least one of the following conserved amino acids 
as depicted in Figure 3, including but not limited to, Leucine in the third extracellular 
domain; Histidine in the third extracellular domain; Cysteine in the sixth transmembrane 
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domain; Tryptophan in the fourth extracellular domain; Glutamine in the seventh 
transmembrane domain; Proline in the seventh transmembrane domain; Alanine in the 
fourth intracellular domain; or Tyrosine in the fourth intracellular domain. In addition, 
the conserved amino acids may be selected from any of the amino acid residues indicated 
as being conserved among DOR proteins as depicted in Figure 3 (shaded). 

Thus, the proteins of the present invention include molecules having the amino 
acid sequence disclosed in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 
30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82, 84, 86, 88, 90, 92, 94, 96 and 98; fragments thereof having a consecutive 
sequence of at least about 3, 4, 5, 6, 10, 15, 20, 25, 30, 35 or more amino acid residues of 
the proteins, for instance, antigenic fragments such as those found in the extracellular 
domains of the protein (see Figure 3); amino acid sequence variants wherein an amino 
acid residue has been inserted N- or C-terminal to, or within, the disclosed sequence; and 
amino acid sequence variants of the disclosed sequences, or their fragments as defined 
above, that have been substituted by another residue. Contemplated variants further 
include those containing predetermined mutations by, e.g., homologous recombination, 
site-directed or PCR mutagenesis, and the corresponding proteins of other insect species, 
including but not limited to the order Diptera, Lepidoptera, Homopterera and Coleoptera, 
within these orders, preferably the genus Drosophila, Anopheles, Aedes, Ceratitis, 
Muscidae, Culicidae, Anagasta and Popilla and the alleles or other naturally occurring 
variants of the family of proteins; and derivatives wherein the protein has been covalently 
modified by substitution, chemical, enzymatic, or other appropriate means with a moiety 
other than a naturally occurring amino acid (for example a detectable moiety such as an 
enzyme or radioisotope). 

As described below, members of the family of proteins can be used: 1) to identify 
agents which modulate at least one activity of the protein; 2) to identify binding partners 
for the protein, 3) as an antigen to raise polyclonal or monoclonal antibodies, and 4) in 
methods to modify insect behavior. 
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B. Nucleic Acid Molecules 

The present invention further provides nucleic acid molecules which encode any 
of the proteins having SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 

5 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 
80, 82, 84, 86, 88, 90, 92, 94, 96 and 98 and the related proteins herein described, 
preferably in isolated form. As used herein, "nucleic acid" is defined as RNA or DNA 
that encodes a protein or peptide as defined above, is complementary to a nucleic acid 
sequence encoding such peptides, hybridizes to such a nucleic acid and remains stably 

10 bound to it under appropriate stringency conditions, or encodes a polypeptide sharing at 
least 75% sequence identity, preferably at least 80%, and more preferably at least 85%, 
with the peptide sequences in conserved domains. Specifically contemplated are genomic 
DNA, cDNA, mRNA and antisense molecules, as well as nucleic acids based on 
alternative backbones or including alternative bases whether derived from natural sources 

1 5 or synthesized. Such hybridizing or complementary nucleic acids, however, are defined 
further as being novel and non-obvious over any prior art nucleic acid including that 
which encodes, hybridizes under appropriate stringency conditions, or is complementary 
to nucleic acid encoding a protein according to the present invention. 

Homology or identity at the amino acid or nucleotide level is determined by 

20 BLAST (Basic Local Alignment Search Tool) analysis using the algorithm employed by 
the programs blastp, blastn, blastx, tblastn and tblastx (Karlin et al, (1990) Proc. Natl. 
Acad. Sci. USA 87, 2264-2268 and Altschul, (1993) J. Mol. Evol. 36, 290-300, fully 
incorporated by reference) which are tailored for sequence similarity searching. The 
approach used by the BLAST program is to first consider similar segments between a 

25 query sequence and a database sequence, then to evaluate the statistical significance of all 
matches that are identified and finally to summarize only those matches which satisfy a 
preselected threshold of significance. For a discussion of basic issues in similarity 
searching of sequence databases (see Altschul et al, (1994) Nature Genetics 6, 1 19-129 
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which is fully incorporated by reference). The search parameters for histogram, 
descriptions, alignments, expect (i.e., the statistical significance threshold for reporting 
matches against database sequences), cutoff, matrix and filter are at the default settings. 
The default scoring matrix used by blastp, blastx, tblastn, and tblastx is the 
5 BLOSUM62 matrix (Henikoff et ah, (1992) Proc. Natl. Acad. Sci. USA 89, 

10915-10919, fully incorporated by reference). For blastn, the scoring matrix is set by 
the ratios of M (i.e., the reward score for a pair of matching residues) to N (i.e., the 
penalty score for mismatching residues), wherein the default values for M and N are 5 
and -4, respectively. 

1 o "Stringent conditions" are those that (1) employ low ionic strength and high 

temperature for washing, for example, 0.5 M sodium phosphate buffer at pH 7.2, 1 mM 
EDTA at pH 8.0 in 7% SDS at either 65°C or 55°C, or (2) employ during hybridization a 
denaturing agent such as formamide, for example, 50% formamide with 0.1% bovine 
serum albumin, 0.1% Ficoll, 0.1% polyvinylpyrrolidone, 0.05 M sodium phosphate buffer 

15 at pH 6.5 with 0.75 M NaCl, 0.075 M sodium citrate at 42°C. Another example is use of 
50% formamide, 5x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium 
phosphate at pH 6.8, 0.1% sodium pyrophosphate, 5x Denhardt's solution, sonicated 
salmon sperm DNA (50 u.g/ml), 0.1% SDS and 10% dextran sulfate at 55°C, with washes 
at 55°C in 0.2x SSC and 0.1% SDS. A skilled artisan can readily determine and vary the 

20 stringency conditions appropriately to obtain a clear and detectable hybridization signal. 
Preferred molecules are those that hybridize under the above conditions to the 
complements of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 
35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 
83, 85, 87, 89, 91, 93, 95 and 97, and which encode a functional protein. 

25 As used herein, a nucleic acid molecule is said to be "isolated" when the nucleic 

acid molecule is substantially separated from contaminant nucleic acid encoding other 
polypeptides from the source of nucleic acid. 

The present invention further provides fragments of any one of the encoding nucleic 
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acids molecules. As used herein, a fragment of an encoding nucleic acid molecule refers to a 
small portion of the entire protein coding sequence. The size of the fragment will be 
determined by the intended use. For example, if the fragment is chosen so as to encode an 
active portion of the protein, the fragment will need to be large enough to encode the 
5 functional region(s) of the protein. For instance, fragments of the invention encode antigenic 
fragments such as the extracellular loops or N-terminal domain of the protein depicted in SEQ 
ID NO: 2 and as set forth in Figure 3. If the fragment is to be used as a nucleic acid probe or 
PCR primer, then the fragment length is chosen so as to obtain a relatively small number of 
false positives during probing and priming. 

10 Fragments of the encoding nucleic acid molecules of the present invention (i.e., 

synthetic oligonucleotides) that are used as probes or specific primers for the polymerase 
chain reaction (PCR), or to synthesize gene sequences encoding proteins of the invention can 
easily be synthesized by chemical techniques, for example, the phosphotriester method of 
Matteucci et al, (1981) J. Am. Chem. Soc. 103, 3185-3191) or using automated synthesis 

1 5 methods. In addition, larger DNA segments can readily be prepared by well known methods, 
such as synthesis of a group of oligonucleotides that define various modular segments of the 
gene, followed by ligation of oligonucleotides to build the complete modified gene. 

The encoding nucleic acid molecules of the present invention may further be 
modified so as to contain a detectable label for diagnostic and probe purposes. A variety 

20 of such labels are known in the art and can readily be employed with the encoding 

molecules herein described. Suitable labels include, but are not limited to, fluorescent- 
labeled, biotin-labeled, radio-labeled nucleotides and the like. A skilled artisan can 
employ any of the art known labels to obtain a labeled encoding nucleic acid molecule. 
Modifications to the primary structure itself by deletion, addition, or alteration of the 

25 amino acids incorporated into the protein sequence during translation can be made without 

destroying the activity of the protein. Such substitutions or other alterations result in proteins 
having an amino acid sequence encoded by a nucleic acid falling within the contemplated 
scope of the present invention. 
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C. Isolation of Other Related Nucleic Acid Molecules 

As described above, the identification and characterization of the nucleic acid 
molecules having SEQ ID NO: 1,3,5, 7, 9, 11, 13, 15, 17, 19,21,23,25,27, 29,31,33,35, 

5 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 
87, 89, 91, 93, 95 and 97 allows a skilled artisan to isolate nucleic acid molecules that encode 
other members of the protein family in addition to the sequences herein described. Further, 
the presently disclosed nucleic acid molecules allow a skilled artisan to isolate nucleic acid 
molecules that encode other members of the family of proteins in addition to the protein 

10 having SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 
44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 
94, 96 and 98. 

Essentially, a skilled artisan can readily use any one of the amino acid sequences 
selected from SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 

15 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 
86, 88, 90, 92, 94, 96 and 98, to generate antibody probes to screen expression libraries 
prepared from appropriate cells. Typically, polyclonal antiserum from mammals such as 
rabbits immunized with the purified protein (as described below) or monoclonal antibodies 
can be used to probe a cDNA or genomic expression library to obtain the appropriate coding 

20 sequence for other members of the protein family. The cloned cDNA sequence can be 

expressed as a fusion protein, expressed directly using its own control sequences, or expressed 
by constructions using control sequences appropriate to the particular host used for expression 
of the enzyme. 

Alternatively, a portion of the coding sequence herein described can be synthesized 
25 and used as a probe to retrieve DNA encoding a member of the protein family from any 
organism. Oligomers containing approximately 1 8-20 nucleotides (encoding about a six to 
seven amino acid stretch) are prepared and used to screen genomic DNA or cDNA libraries to 
obtain hybridization under stringent conditions or conditions of sufficient stringency to 
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eliminate an undue level of false positives. 

Additionally, pairs of oligonucleotide primers can be prepared for use in a polymerase 
chain reaction (PCR) to selectively clone an encoding nucleic acid molecule. A PCR 
denature/anneal/extend cycle for using such PCR primers is well known in the art and can 

5 readily be adapted for use in isolating other encoding nucleic acid molecules. For example, 
degenerate primers can be used to clone any DOR gene across species. Specifically, based on 
the sequence information derived from the family of DORs, degenerate primers can be 
designed based on conserved sequences among olfactory receptors, which can then be used to 
clone nucleic acid molecules encoding olfactory receptor proteins from other species of 

10 insects. 

Applicants have also identified a method for isolating nucleic acid molecules that 
encode other members of the protein family in addition to the sequences herein described. 
Essentially, a two-step strategy is employed to identify odorant receptor genes from the 
genomic database. First, a computer algorithm was designed to search genomic sequences for 

1 5 open reading frames (ORFs) from candidate odorant receptor genes. Second, RT-PCR is used 
to determine if transcripts from any of these ORFs are expressed in olfactory organs. 

The algorithm is used to identify GPCR genes using statistical characterization of 
amino acid physico-chemical profiles in combination with a non-parametric discriminant 
function. The algorithm is trained on a set of putative sequences from a database. In the first 

20 step, three sets of descriptors are used to summarize the physico-chemical profiles of the 

sequences. These are GES scale of hydropathy (Engelman et al, (1986) Annu. Rev. Biophys. 
Biophys. Chem. 15, 321-353), polarity (Brown, (1991) Molecular Biology Labfax, Academic 
Press), and amino acid usage frequency. For the first two of these measurements, a computed 
sliding window profile is employed (White, (1994) Membrane Protein Structure, Oxford 

25 University Press) using a kernel of a certain number of amino acids as a constant function 
convoluted with a certain number of amino acids as a Gaussian function. These profiles are 
then summarized with three statistics; the periodicity, average derivative and the variance of 
the derivative. 
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Each sequence is then characterized by multiple variables using a non-parametric 
linear discriminant function that is optimized to separate the known family proteins from 
random proteins in the framing set. The same linear discriminant function with the scores 
derived from the training set is used to screen any nucleic acid database for candidate genes. 
5 The candidate sequences are given significance values by an odds ratio of the proteins and 
non-family proteins, computed using the observed empirical distribution of the training set. 
Those sequences with a sufficiently high odds ratio are considered for further analysis. The 
algorithm can also be used to identify any protein family by altering the framing set of 
sequences. 

1 o The method of identification further includes steps for identifying novel olfactory 

receptor genes comprising selecting candidate olfactory receptor genes by screening a nucleic 
acid database using an algorithm trained to identify seven transmembrane receptors genes; 
screening said selected candidate olfactory receptor genes by identifying nucleic acid 
sequences with conserved amino acid residues and intron-exon boundaries common to 

1 5 olfactory receptors, and open reading frames of sufficient size as to encode a seven 

transmembrane receptor. As an additional step, the expression of olfactory receptor genes is 
measured to confirm candidate olfactory gene as an olfactory gene. The exon-intron 
boundaries and conserved amino acid residues may be selected from any of the positions 
depicted in Figure 3. Alternatively, selecting candidate olfactory receptor genes by screening 

20 a nucleic acid database for nucleic acid sequences with sufficient homology to at least one 
known olfactory receptor gene is also encompassed in the invention. In a preferred 
embodiment, the nucleic acid database is a genomic database, an EST database or even an 
olfactory receptor database as previously described (Skoufos et al., (1999) Nucleic Acids 
Research 27, 343-345). 

25 In one example of the invention, the tiaining set could consist of a subset of seven 

transmembrane proteins such as dopaminergic receptors and could be used to search genomic 
sequences for new subtypes of doparninergic receptors. In another example, the tiaining set 
could consist of ion channels and could be used to identify new subtypes of ion channels in a 
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particular family. In yet another example, the training set could consist of known sequences 
coding for a receptors from a particular family and could be used to identify homologs across 
species. Specifically, olfactory receptors of one species could be used as a training set to 
identify olfactory receptors in another species. 

D. rDNA molecules containing a DNA molecule 

The present invention further provides recombinant DNA molecules (rDNAs) that 
contain a coding sequence. As used herein, a rDNA molecule is a DNA molecule that has 
been subjected to molecular manipulation in situ. Methods for generating rDNA molecules 
are well known in the art, for example, see Sambrook et al, (1985) Molecular Cloning - A 
Laboratory Manual, Cold Spring Harbor Laboratory Press. In the preferred rDNA molecules, 
a coding DNA sequence is operably linked to expression control sequences or vector 
sequences. 

The choice of vector and expression control sequences to which one of the protein 
family encoding sequences of the present invention is operably linked depends directly, as is 
well known in the art, on the functional properties desired, e.g., protein expression, and the 
host cell to be transformed. A vector contemplated by the present invention is at least capable 
of directing the replication or insertion into the host chromosome, and preferably also 
expression, of the structural gene included in the rDNA molecule. 

Expression control elements that are used for regulating the expression of an operably 
linked protein encoding sequence are known in the art and include, but are not limited to, 
inducible promoters, constitutive promoters, secretion signals, and other regulatory elements. 
Preferably, the inducible promoter is readily controlled, such as being responsive to a nutrient 
in the host cell's medium. 

In one embodiment, the vector containing a coding nucleic acid molecule will include 
a prokaryotic replicon, i.e., a DNA sequence having the ability to direct autonomous 
replication and maintenance of the recombinant DNA molecule extra-chromosomally in a 
prokaryotic host cell, such as a bacterial host cell, transformed therewith. Such replicons are 
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well known in the art. In addition, vectors that include a prokaryotic replicon may also 
include a gene whose expression confers a detectable marker such as a drug resistance. 
Typical bacterial drug resistance genes are those that confer resistance to ampicillin or 
tetracycline. 

5 Vectors that include a prokaryotic replicon can further include a prokaryotic or 

bacteriophage promoter capable of directing the expression (transcription and translation) of 
the coding gene sequences in a bacterial host cell, such as E. coli. A promoter is an 
expression control element formed by a DNA sequence that permits binding of RNA 
polymerase and transcription to occur. Promoter sequences compatible with bacterial hosts 

10 are typically provided in plasmid vectors containing convenient restriction sites for insertion 
of a DNA segment of the present invention. Typical of such vector plasmids are pUC8, 
pUC9, pBR322 and pBR329 available from BioRad Laboratories, pPL and pKK223 available 
from Pharmacia. 

Expression vectors compatible with eukaryotic cells, preferably those compatible with 
1 5 vertebrate cells such as insect cells, can also be used to form a rDNA molecules that contains a 
coding sequence. Eukaryotic cell expression vectors are well known in the art and are 
available from several commercial sources. Typically, such vectors are provided containing 
convenient restriction sites for insertion of the desired DNA segment. Typical of such vectors 
are pSVL and pKSV-10 (Pharmacia), pBPV-l/pML2d (International Biotechnologies, Inc.), 
20 pTDT 1 (ATCC, #3 1 255), the vector pCDM8 described herein, and the like eukaryotic 
expression vectors. Vectors may be modified to include insect cell specific promoters if 
needed. 

Eukaryotic cell expression vectors used to construct the rDNA molecules of the 
present invention may further include a selectable marker that is effective in an eukaryotic 
25 cell, preferably a drag resistance selection marker. A preferred drug resistance marker is the 
gene whose expression results in neomycin resistance, i.e., the neomycin phosphotransferase 
(ned) gene (Southern etal, (1982) J. Mol. Appl. Genet. 1, 327-341). Alternatively, the 
selectable marker can be present on a separate plasmid, and the two vectors are introduced by 
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co-transfection of the host cell, and selected by culturing in the appropriate drug for the 
selectable marker. 



E. Host Cells Containing an Exogenously Supplied Coding Nucleic Acid 

5 The present invention further provides host cells transformed with a nucleic acid 

molecule that encodes a protein of the present invention. The host cell can be either 
prokaryotic or eukaryotic. Eukaryotic cells useful for expression of a protein of the invention 
are not limited, so long as the cell line is compatible with cell culture methods and compatible 
with the propagation of the expression vector and expression of the gene product. Preferred 

1 0 eukaryotic host cells include, but are not limited to, yeast, insect and mammalian cells, 

preferably insect cells such as those from & Drosophila cell line. Preferred Drosophila host 
cells include Drosophila Schneider line 2, and the like insect tissue culture cell lines. 

Any prokaryotic host can be used to express a rDNA molecule encoding a protein of 
the invention. The preferred prokaryotic host is E. coli. 

1 5 Transformation of appropriate cell hosts with a rDNA molecule of the present 

invention is accomplished by well known methods that typically depend on the type of vector 
used and host system employed. With regard to transformation of prokaryotic host cells, 
electroporation and salt treatment methods are typically employed, see, for example, Cohen et 
al, (1972) Proc. Natl. Acad. Sci. USA 69, 21 10-21 14; and Maniatis et al, (1982) Molecular 

20 Cloning - A Laboratory Manual, Cold Spring Harbor Laboratory Press. With regard to 

transformation of vertebrate cells with vectors containing rDNAs, electroporation, cationic 
lipid or salt treatment methods are typically employed, see, for example, Graham et al, (1973) 
Virology 52, 456-467; and Wigler et al, (1979) Proc. Natl. Acad. Sci. USA 76, 1373-1376. 

Successfully transformed cells, i.e., cells that contain a rDNA molecule of the present 

25 invention, can be identified by well known techniques including the selection for a selectable 
marker. For example, cells resulting from the introduction of an rDNA of the present 
invention can be cloned to produce single colonies. Cells from those colonies can be 
harvested, lysed and their DNA content examined for the presence of the rDNA using a 
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method such as that described by Southern, (1975) J. Mol. Biol. 98, 503-517; or Berent et al, 
(1985) Biotech. Histochem. 3, 208; or the proteins produced from the cell assayed via an 
immunological method. 

F. Production of Recombinant Proteins using a rDNA Molecule 

The present invention further provides methods for producing a protein of the 
invention using nucleic acid molecules herein described. In general terms, the production of a 
recombinant form of a protein typically involves the following steps: First, a nucleic acid 
molecule is obtained that encodes a protein of the invention, such as any of the nucleic acid 
molecule depicted in SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 
35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 
85, 87, 89, 91, 93, 95 and 97. The nucleic acid molecule is then preferably placed in operable 
linkage with suitable control sequences, as described above, to form an expression unit 
containing the protein open reading frame. The expression unit is used to transform a suitable 
host and the transformed host is cultured under conditions that allow the production of the 
recombinant protein. Optionally the recombinant protein is isolated from the medium or 
from the cells; recovery and purification of the protein may not be necessary in some instances 
where some impurities may be tolerated. 

Each of the foregoing steps can be done in a variety of ways. For example, the desired 
coding sequences may be obtained from genomic fragments and used directly in appropriate 
hosts. The construction of expression vectors that are operable in a variety of hosts is 
accomplished using appropriate replicons and control sequences, as set forth above. The 
control sequences, expression vectors, and transformation methods are dependent on the type 
of host cell used to express the gene and were discussed in detail earlier. Suitable restriction 
sites can, if not normally available, be added to the ends of the coding sequence so as to 
provide an excisable gene to insert into these vectors. A skilled artisan can readily adapt any 
host-expression system known in the art for use with the nucleic acid molecules of the 
invention to produce recombinant protein. 
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G. Methods to Identify Binding Partners 

Another embodiment of the present invention provides methods for use in isolating 
and identifying binding partners of any of the DOR proteins of the invention. In detail, a 

5 protein of the invention is mixed with a potential binding partner or an extract or fraction of a 
cell under conditions that allow the association of potential binding partners with the protein 
of the invention. After mixing, peptides, polypeptides, proteins or other molecules that have 
become associated with a protein of the invention are separated from the mixture. The 
binding partner that bound to the protein of the invention can then be removed and further 

1 0 analyzed. To identify and isolate a binding partner, the entire protein, for instance a protein 
comprising the entire amino acid sequence of any of the proteins depicted in SEQ ID NO: 2, 
4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 
56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96 and 98 can be 
used. Alternatively, a fragment of any of the proteins can be used. 

1 5 As used herein, a cellular extract refers to a preparation or fraction which is made from 

a lysed or disrupted cell. The preferred source of cellular extracts will be cells derived from 
Drosophila, for instance, antennae and maxillary palp cellular extract. 

A variety of methods can be used to obtain an extract of a cell. Cells can be disrupted 
using either physical or chemical disruption methods. Examples of physical disruption 

20 methods include, but are not limited to, sonication and mechanical shearing. Examples of 
chemical lysis methods include, but are not limited to, detergent lysis and enzyme lysis. A 
skilled artisan can readily adapt methods for preparing cellular extracts in order to obtain 
extracts for use in the present methods. 

Once an extract of a cell is prepared, the extract is mixed with any of the proteins of 

25 the invention under conditions in which association of the protein with the binding partner can 
occur. A variety of conditions can be used, the most preferred being conditions that closely 
resemble conditions found in the cytoplasm of a Drosophila cell. Features such as osmolality, 
pH, temperature, and the concentration of cellular extract used, can be varied to optimize the 
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association of the protein with the binding partner. 

After mixing under appropriate conditions, the bound complex is separated from the 
mixture. A variety of techniques can be utilized to separate the mixture. For example, 
antibodies specific to a protein of the invention can be used to immunoprecipitate the binding 
partner complex. Alternatively, standard chemical separation techniques such as 
chromatography and density-sediment centrifugation can be used. 

After removal of non-associated cellular constituents found in the extract, the binding 
partner can be dissociated from the complex using conventional methods. For example, 
dissociation can be accomplished by altering the salt concentration or pH of the mixture. 

To aid in separating associated binding partner pairs from the mixed extract, the 
protein of the invention can be immobilized on a solid support. For example, the protein can 
be attached to a nitrocellulose matrix or acrylic beads. Attachment of the protein to a solid 
support aids in separating peptide-binding partner pairs from other constituents found in the 
extract. The identified binding partners can be either a single protein or a complex made up of 
two or more proteins. Alternatively, binding partners may be identified using a Far- Western 
assay according to the procedures of Takayama et al, (1997) Methods Mol. Biol. 69, 171-184 
or identified through the use of epitope tagged proteins or GST fusion proteins. 

Alternatively, the nucleic acid molecules of the invention can be used in a yeast two- 
hybrid system. The yeast two-hybrid system has been used to identify other protein partner 
pairs (Alifragis et al., (1997) Proc. Natl. Acad. Sci. USA 94, 13099-13104; Dong et al, 
(1999) Gene 237, 421-428) and can readily be adapted to employ the nucleic acid molecules 
herein described. 

In another embodiment, binding partners may be identified in insects using single unit 
recordings as previously described (Kaissling, (1995) Single unit and electroantennogram 
recordings in insect olfactory organs, in: Spielman & Brand (ed.) Experimental Cell Biology 
of Taste and Olfaction, CRC Press). Using single unit recordings in vivo, response profiles 
are established for potential ligands, these profiles are then categorized into distinct functional 
classes indicative of distinct receptor-ligand interactions (see, e.g., U.S. Patent No. 5,993,778). 
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Single unit recordings in transgenic insects which contain transgenes resulting in over- or 
under-expression of a gene are also useful for identifying and characterizing ligands which 
bind to multiple olfactory receptors as well as identifying characterizing new olfactory 
receptors. 

5 The nucleic acids of the invention and their corresponding proteins can be used on an 

array or microarray for high-throughput screening for agents which interact with either the 
nucleic acids of the invention or their corresponding proteins. An "array" or "microarray" 
generally refers to a grid system which has each position or probe cell occupied by a defined 
nucleic acid fragments also known as oligonucleotides. The arrays themselves are sometimes 

1 0 referred to as "chips" or "biochips". High-density nucleic acid and protein microarrays often 
have thousands of probe cells in a variety of grid styles. 

A typical molecular detection chip includes a substrate on which an array of 
recognition sites, binding sites or hybridization sites are arranged. Each site has a respective 
molecular receptor which binds or hybridizes with a molecule having a predetennined 

1 5 structure. The solid support substrates which can be used to form surface of the array or chip 
include organic and inorganic substrates, such as glass, polystyrenes, polyiniides, silicon 
dioxide and silicon nitride. For direct attachment of probes to the electrodes, the electrode 
surface must be fabricated with materials capable of forming conjugates with the probes. 

Once the array is fabricated, a sample solution is applied to the molecular detection 

20 chip and molecules in the sample bind or hybridize at one or more sites. The sites at which 
binding occurs are detected, and one or more molecular structures within the sample are 
subsequently deduced. Detection of labeled batches is a traditional detection strategy and 
includes radioisotope, fluorescent and biorin labels, but other options are available, including 
electronic signal transduction. 

25 Polymer arrays of nucleic acid probes can be used to extract information from, for 

example, nucleic acid samples. These samples are exposed to the probes under conditions that 
permit binding. The arrays are then scanned to determine to which probes the sample 
molecules have interacted with the nucleic acids of the polymer array. One can obtain 
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information by careful probe selection and using algorithms to compare patterns of 
interactions. For example, the method is useful in screening for novel olfactory receptors in 
multiple organisms. For example, Drosophila degenerate olfactory receptor oligonucleotide 
arrays can be used to examine a nucleic acid sample from another insect species in order to 
identify novel olfactory receptors in that species. 

In typical applications, a complex solution containing one or more substances to be 
characterized contacts a polymer array comprising nucleic acids. For example, the array is 
comprised of nucleic acid probes. The probes of the array can be either DNA or RNA, which 
may be either single-stranded or double-stranded. In a preferred embodiment of the 
invention, the probes are arranged (either by immobilization, typically by covalent 
attachment, of a pre-synthesized probe or by synthesis of the probe on the substrate) on the 
substrate or chips in lanes sketching across the chip and separated, and these lanes are in 
turned arranged in blocks of preferably five lanes, although blocks of other sizes will have 
useful application. The present invention provides individual probes, sets of probes, and 
arrays of probe sets on chips, in specific patterns which are used to characterize the 
substances in a complex mixture by producing a distinct image which is representative of the 
binding interactions between the probes on the chip and the substances in the complex 
mixture. The pattern of hybridization to the chip allows inferences to be drawn about the 
substances present in the complex mixture. 

The substances in the complex solution will bind to the nucleic acids on the array. 
The substances of the complex mixture which bind to the nucleic acids of the array may 
include, but are not limited to, complementary nucleic acids, non-complementary nucleic 
acids, proteins, antibodies, oligosaccharides, etc. The types of binding may include, but are 
not limited to, specific and non-specific, competitive and non-competitive, allosteric, 
cooperative, non-cooperative, complementary and non-complementary, etc. For example, the 
nucleic acids of the array can bind to complementary nucleic acids in the complex mixture but 
can also bind in a tertiary manner, independent of base pairing, to non-complementary nucleic 
acids. 
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The nucleic acids of the array or the substances of the complex mixture may be tagged 
with a detectable label. The detectable label can be, for example, a luminescent label, a light 
scattering label or a radioactive label. Accordingly, locations at which substances interact can 
be identified by either determining if the signal of the label has been quenched by binding or 
identifying locations where the signal of the label is present in cases where the substances of 
the complex mixture have been labeled. Based on the locations where binding is detected, 
information regarding the complex mixture can be obtained. 

The methods of this invention will find particular use wherever high through-put of 
samples is required. In particular, this invention is useful in ligand screening settings and for 
determining the composition of complex mixtures. 

Polypeptides are an exemplary system for exploring the relationship between structure 
and function in biology. When the twenty naturally occurring amino acids are condensed into 
a polymeric molecule they form a wide variety of three-dimensional configurations, each 
resulting from a particular amino acid sequence and solvent condition. For example, the 
number of possible polypeptide configurations using the twenty naturally occurring amino 
acids for a polymer five amino acids long is over three million. Typical proteins are more 
than one-hundred amino acids in length. 

In typical applications, a complex solution containing one or more substances to be 
characterized contacts a polymer array comprising polypeptides. The polypeptides of the 
invention can be prepared by classical methods known in the art, for example, by using 
standard solid phase techniques. The standard methods include exclusive solid phase 
synthesis, partial solid phase synthesis methods, fragment condensation, classical solution 
synthesis and recombinant DNA technology (see Merrifield, (1963) Am. Chem. Soc. 85, 
2149-2152). On solid phase, the synthesis is typically commenced from the C-terminal end of 
the peptide using an alpha-amino protected resin. A suitable starting material can be prepared, 
for instance, by attaching the required alpha-amirio acid to a chloromethylated resin, a 
hydroxy-methyl resin or a benzhydrylamine resin. 

The alpha-arnino protecting groups are those known to be useful in the art of stepwise 
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synthesis of peptides. Included are acyl type protecting groups, aromatic urethane type 
protecting groups, aliphatic urethane protecting groups and alkyl type protecting groups. The 
side chain protecting group remains intact during coupling and is not split off during the 
deprotection of the ammo-terrriinus protecting group or during coupling. The side chain 
5 protecting group must be removable upon the completion of the synthesis of the final peptide 
and under reaction conditions that will not alter the target peptide. 

After removal of the alpha-amino protecting group, the remaining protected amino 
acids are coupled stepwise in the desired order. An excess of each protected amino acid is 
generally used with an appropriate carboxyl group activator such as 
1 0 dicyclohexylcarbodiimide (DCC) in solution, for example, in methylene chloride, dimethyl 
formamide (DMF) mixtures. 

These procedures can also be used to synthesize peptides in which amino acids other 
than the twenty naturally occurring, genetically encoded amino acids are substituted at one, 
two, or more positions of any of the compounds of the invention. For instance, 
1 5 naphthylalanine can be substituted for tryptophan, facilitating synthesis. Other synthetic 
amino acids that can be substituted into the peptides of the present invention include 
L-hydroxypropyl, L-3, 4-dihydroxyphenylalanyl, d-amino acids such as L-d-hydroxylysyl and 
D-d-methylalanyl, L-a-methylalanyl and P-amino acids non-naturally occurring synthetic 
amino acids can also be incorporated into the peptides of the present invention (see Roberts et 
20 al, (1983) Peptide Synthesis 5, 341-449). 

One can replace the naturally occurring side chains of the twenty genetically encoded 
amino acids (or D amino acids) with other side chains, for instance with groups such as alkyl, 
lower alkyl, cyclic four, five, six, to seven-membered alkyl, amide, amide lower alkyl, amide 
di(lower alkyl), lower alkoxy, hydroxy, carboxy and the lower ester derivatives thereof, and 
25 with four, five, six, to seven-membered heterocyclic. In particular, proline analogs in which 
the ring size of the proline residue is changed from five members to four, six or seven 
members can be employed. Cyclic groups can be saturated or unsaturated, and if unsaturated, 
can be aromatic or non-aromatic. Heterocyclic groups preferably contain one or more 
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nitrogen, oxygen, and/or sulphur heteroatoms. Examples of such groups include the 
furazanyl, furyl, imidazolidinyl, imidazolyl, imidazolinyl, isothiazolyl, isoxazolyl, 
morpholinyl, oxazolyl, piperazinyl, piperidyl, pyranyl, pyrazinyl, pyrazolidinyl, pyrazolinyl, 
pyrazolyl, pyridazinyl, pyridyl, pyrrmidinyl, pyrrolidinyl, pyrrolinyl, pyrrolyl, thiadiazolyl, 
5 thiazolyl, thienyl, thiomorpholinyl and triazolyl. These heterocyclic groups can be substituted 
or unsubstituted. Where a group is substituted, the substituent can be alkyl, alkoxy, halogen, 
oxygen, or substituted or unsubstituted phenyl. 

One can also readily modify the peptides of the instant invention by phosphorylation 
(see Bannwarth et al, (1996) Biorg. Med. Chem. Let. 6, 2141-2146) and other methods for 

1 0 making peptide derivatives of the compounds of the present invention are described in Hruby 
et al, (1990) Biochem. J. 268, 249-262). Thus, the peptide compounds of the invention also 
serve as a basis to prepare peptide mimetics with similar biological activity. The array can 
also comprise peptide mimetics with the same or similar desired biological activity as the 
corresponding peptide compound but with more favorable activity than the peptide with 

1 5 respect to solubility, stability, and susceptibility to hydrolysis and proteolysis (see Morgan et 
al, (1989) Ann. Rep. Med. Chem. 24, 243-252). 

Peptides suitable for use in this embodiment generally include those peptides, for 
example, ligands, that bind to a receptor, such as seven transmembrane proteins. Such 
peptides typically comprise about 150 amino acid residues or less and, more preferably, about 

20 100 amino acid residues or less. 

The peptides of the present invention may exist in a cyclized form with an 
intramolecular disulfide bond between the thiol groups of the cysteines. Alternatively, an 
intermolecular disulfide bond between the thiol groups of the cysteines can be produced to 
yield a dimeric (or higher oligomeric) compound. One or more of the cysteine residues may 

25 also be substituted with a homocysteine. Other embodiments of this invention provide for 
analogs of these disulfide derivatives in which one of the sulfurs has been replaced by a CH2 
group or other isostere for sulfur. These analogs can be made via an intramolecular or 
intermolecular displacement, using methods known in the art. 
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H. Methods to Identify Agents that Modulate Expression of DORs. 

Another embodiment of the present invention provides methods for identifying agents 
that modulate the expression of a nucleic acid encoding any one of the DOR proteins of the 
invention such as any protein having the amino acid sequence depicted in SEQ ID NO: 2, 4, 6, 
5 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 
58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96 and 98. Such 
assays may utilize any available means of monitoring for changes in the expression level of 
the nucleic acids of the invention. As used herein, an agent is said to modulate the expression 
of a nucleic acid of the invention, for instance a nucleic acid encoding any one of the proteins 

1 0 having the amino acid sequence depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 1 8, 20, 22, 
24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 
74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96 and 98, if it is capable of up- or down-regulating 
expression of the nucleic acid in a cell. 

hi one assay format, cell lines that contain reporter gene fusions between the open 

15 reading frame of any one of the nucleotides depicted in SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13, 15, 
17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 
67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95 and 97 and any assay fusion partner 
may be prepared. Numerous assay fusion partners are known and readily available including 
the firefly luciferase gene and the gene encoding chloramphenicol acetyltransferase (Alam et 

20 al, (1990) Anal. Biochem. 188, 245-254). Cell lines containing the reporter gene fusions are 
then exposed to the agent to be tested under appropriate conditions and time. Differential 
expression of the reporter gene between samples exposed to the agent and control samples 
identifies agents which modulate the expression of a nucleic acid encoding at least one of the 
proteins having the sequence depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 

25 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 
76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96 and 98. 

Additional assay formats may be used to monitor the ability of the agent to modulate 
the expression of a nucleic acid encoding at least one protein of the invention selected from 



-33- 



Attorney Docket No. 44574-5061 

the group of proteins having SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 
32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 
82, 84, 86, 88, 90, 92, 94, 96 and 98. For instance, mRNA expression may be monitored 
directly by hybridization to the nucleic acids of the invention. Cell lines are exposed to the 
5 agent to be tested under appropriate conditions and time and total RNA or mRNA is isolated 
by standard procedures such those disclosed in Sambrook et al., (1985) Molecular Cloning - 
A Laboratory Manual, Cold Spring Harbor Laboratory Press. 

Probes to detect differences in RNA expression levels between cells exposed to the 
agent and control cells may be prepared from the nucleic acids of the invention. It is 

1 0 preferable, but not necessary, to design probes which hybridize only with target nucleic acids 
under conditions of high stringency. Only highly complementary nucleic acid hybrids form 
under conditions of high stringency. Accordingly, the stringency of the assay conditions 
determines the amount of complementary nucleotides which should exist between two nucleic 
acid strands in order to form a hybrid. Stringency should be chosen to maximize the 

1 5 difference in stability between the probe:target hybrid and potential probe:non-target hybrids. 

Probes may be designed from the nucleic acids of the invention through methods 
known in the art. For instance, the G+C content of the probe and the probe length can affect 
probe binding to its target sequence. Methods to optimize probe specificity are commonly 
available in Sambrook et al, (1985) Molecular Cloning - A Laboratory Manual, Cold Spring 

20 Harbor Laboratory Press; or Ausubel et al. , (1 995) Current Protocols in Molecular Biology, 
Greene Publishing Company. 

Hybridization conditions are modified using known methods, such as those described 
by Sambrook et al, (1985) and Ausubel et al, (1995) as required for each probe. 
Hybridization of total cellular RNA or RNA enriched for polyA+ RNA can be accomplished 

25 in any available format. For instance, total cellular RNA or RNA enriched for polyA RNA 
can be affixed to a solid support and the solid support exposed to at least one probe 
comprising at least one, or part of one of the sequences of the invention under conditions in 
which the probe will specifically hybridize. Alternatively, nucleic acid fragments comprising 
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at least one, or part of one of the sequences of the invention can be affixed to a solid support, 
such as a porous glass wafer. The glass wafer can then be exposed to total cellular RNA or 
polyA RNA from a sample under conditions in which the affixed sequences will specifically 
hybridize. Such glass wafers and hybridization methods are widely available, for example, 
5 those disclosed by Beattie (WO 95/1 1755). By examining for the ability of a given probe to 
specifically hybridize to an RNA sample from an untreated cell population and from a cell 
population exposed to the agent, agents which up- or down-regulate the expression of a 
nucleic acid encoding at least one protein having the amino acid sequence depicted in SEQ ID 
NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 
1 0 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96 and 98 
are identified. 

Hybridization for qualitative and quantitative analysis of mRNA may also be carried 
out by using a RNase Protection Assay {i.e., RPA, see Ma et al, (1996) Methods 10, 273- 
238). Briefly, an expression vehicle comprising cDNA encoding the gene product and a 

15 phage specific DNA dependent RNA polymerase promoter {e.g., T7, T3 or SP6 RNA 

polymerase) is linearized at the 3' end of the cDNA molecule, downstream from the phage 
promoter, wherein such a linearized molecule is subsequently used as a template for synthesis 
of a labeled antisense transcript of the cDNA by in vitro transcription. The labeled transcript 
is then hybridized to a mixture of isolated RNA {i.e., total or fractionated mRNA) by 

20 incubation at 45°C overnight in a buffer comprising 80% formamide, 40 rriM Pipes, pH 6.4, 
0.4 M NaCl and 1 mM EDTA The resulting hybrids are then digested in a buffer comprising 
40 ug/ml ribonuclease A and 2 ug/ml ribonuclease. After deactivation and extraction of 
extraneous proteins, the samples are loaded onto urea-polyacrylamide gels for analysis. 
In another assay format, agents which effect the expression of the instant gene 

25 products, cells or cell lines would first be identified which express said gene products 
physiologically. Cells and cell lines so identified would be expected to comprise the 
necessary cellular machinery such that the fidelity of modulation of the transcriptional 
apparatus is maintained with regard to exogenous contact of agent with appropriate surface 
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transduction mechanisms and the cytosolic cascades. Further, such cells or cell lines would be 
transduced or transfected with an expression vehicle (e.g., a plasmid or viral vector) construct 
comprising an operable non-translated 5 '-promoter containing end of the structural gene 
encoding the instant gene products fused to one or more antigenic fragments, which are 
5 peculiar to the instant gene products, wherein said fragments are under the transcriptional 
control of said promoter and are expressed as polypeptides whose molecular weight can be 
distinguished from the naturally occurring polypeptides or may further comprise an 
immunologically distinct tag. Such a process is well known in the art (see Maniatis et al. , 
(1982) Molecular Cloning - A Laboratory Manual, Cold Spring Harbor Laboratory Press). 

1 o Cells or cell lines transduced or transfected as outlined above would then be contacted 

with agents under appropriate conditions; for example, the agent comprises an acceptable 
excipient and is contacted with cells comprised in an aqueous physiological buffer such as 
phosphate buffered saline (PBS) at physiological pH, Eagles balanced salt solution (BSS) at 
physiological pH, PBS or BSS comprising serum or conditioned media comprising PBS or 

15 BSS and/or serum incubated at 37°C. Said conditions may be modulated as deemed 

necessary by one of skill in the art. Subsequent to contacting the cells with the agent, said 
cells will be disrupted and the polypeptides from disrupted cells are fractionated such that a 
polypeptide fraction is pooled and contacted with an antibody to be further processed by 
immunological assay (e.g., ELISA, immunoprecipitation or Western blot). The pool of 

20 proteins isolated from the "agent contacted" sample will be compared with a control sample 
where only the excipient is contacted with the cells and an increase or decrease in the 
immunologically generated signal from the "agent contacted" sample compared to the control 
will be used to distinguish the effectiveness of the agent. 

25 I. Methods to Identify Agents that Modulate Activity of DORs 

Another embodiment of the present invention provides methods for identifying agents 
that modulate at least one activity of a protein of the invention such as any one of the proteins 
having the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 
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28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82, 84, 86, 88, 90, 92, 94, 96 and 98. Such methods or assays may utilize any means 
of monitoring or detecting the desired activity. 

In one format, the relative amounts of a protein of the invention between a cell 
5 population that has been exposed to the agent to be tested compared to an un-exposed control 
cell population may be assayed. In this format, probes such as specific antibodies are used to 
monitor the differential expression of the protein in the different cell populations. Cell lines or 
populations are exposed to the agent to be tested under appropriate conditions and time. 
Cellular lysates may be prepared from the exposed cell line or population and a control, 

1 0 unexposed cell line or population. The cellular lysates are then analyzed with the probe. 

Antibody probes are prepared by immunizing suitable mammalian hosts in appropriate 
immunization protocols using the peptides, polypeptides or proteins of the invention if they 
are of sufficient length, or if desired, required to enhance immunogenicity, conjugated to 
suitable carriers. Methods for preparing immunogenic conjugates with carriers such as BSA, 

1 5 KLH, or other carrier proteins are well known in the art. In some circumstances, direct 

conjugation using, for example, carbodiirnide reagents may be effective; in other instances 
linking reagents such as those supplied by Pierce Chemical Co., may be desirable to provide 
accessibility to the hapten. The hapten peptides can be extended at either the amino or 
carboxy terminus with a cysteine residue or interspersed with cysteine residues, for example, 

20 to facilitate linking to a carrier. Adniiriistration of the immunogens is conducted generally by 
injection over a suitable time period and with use of suitable adjuvants, as is generally 
understood in the art. During the immunization schedule, titers of antibodies are taken to 
determine adequacy of antibody formation. 

While the polyclonal antisera produced in this way may be satisfactory for some 

25 applications, for some applications, use of monoclonal preparations is preferred. 

Immortalized cell lines which secrete the desired monoclonal antibodies may be prepared 
using the standard method of Kohler & Milstein, (1975) Nature 256, 495-497 or modifications 
which effect immortalization of lymphocytes or spleen cells, as is generally known. The 
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immortalized cell lines secreting the desired antibodies are screened by immunoassay in 
which the antigen is the peptide hapten, polypeptide or protein. When the appropriate 
immortalized cell culture secreting the desired antibody is identified, the cells can be cultured 
either in vitro or by production in ascites fluid. 
5 The desired monoclonal antibodies are then recovered from the culture supernatant or 

from the ascites supernatant. Fragments of the monoclonal or polyclonal antisera which 
contain the immunologically significant portion can be used as antagonists, as well as the 
intact antibodies. Use of immunologically reactive fragments, such as the Fab, Fab' of F(ab') 2 
fragments is often preferable, as these fragments are generally less immunogenic than the 

10 whole immunoglobulin. 

The antibodies or fragments may also be produced, using current technology, by 
recombinant means. Antibody regions that bind specifically to the desired regions of the 
protein can also be produced in the context of chimeras with multiple species origin, 
particularly humanized antibodies. 

1 5 Agents that are assayed in the above method can be randomly selected or rationally 

selected or designed. As used herein, an agent is said to be randomly selected when the agent 
is chosen randomly without considering the specific sequences involved in the association of 
the a protein of the invention alone or with its associated substrates, binding partners, etc. An 
example of randomly selected agents is the use a chemical library or a peptide combinatorial 

20 library, or a growth broth of an organism. 

As used herein, an agent is said to be rationally selected or designed when the agent is 
chosen on a non-random basis which takes into account the sequence of the target site and its 
conformation in connection with the agent's action. Agents can be rationally selected or 
rationally designed by utilizing the peptide sequences to identify proposed binding motifs, 

25 glycosylation and phosphorylation sites on the protein. 

The agents of the present invention can be, as examples, peptides, small molecules, 
vitamin derivatives, as well as carbohydrates. A skilled artisan can readily recognize that 
there is no limit as to the structural nature of the agents of the present invention. Dominant- 
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negative proteins, DNA encoding these proteins, antibodies to these proteins, peptide 
fragments of these proteins or rnimics of these proteins may be contacted with cells to affect 
function. "Mimic" as used herein refers to the modification of a region or several regions of a 
peptide molecule to provide a structure chemically different from the parent peptide but 

5 topographically and functionally similar to the parent peptide (see Meyers, (1995) Molecular 
Biology & Biotechnology, VCH Publishers). 

The peptide agents of the invention can be prepared using standard solid phase (or 
solution phase) peptide synthesis methods, as is known in the art. In addition, the DNA 
encoding these peptides may be synthesized using commercially available oligonucleotide 

1 0 synthesis instrumentation and produced recombinantly using standard recombinant production 
systems. The production using solid phase peptide synthesis is necessitated if non-gene- 
encoded amino acids are to be included. 

Another class of agents of the present invention are antibodies immunoreactive with 
critical positions of proteins of the invention. Antibody agents are obtained by immunization 

15 of suitable mammalian subjects with peptides, containing as antigenic regions, those portions 
of the protein intended to be targeted by the antibodies. 



J. Transgenic Organisms 

Transgenic insects containing mutant, knock-out or modified genes corresponding to 
20 any one ofthecDNA sequences depicted in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 
23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 
73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95 and 97 are also included in the invention. 
Transgenic insects are genetically modified insects into which recombinant, exogenous or 
cloned genetic material has been experimentally transferred. Such genetic material is often 
25 referred to as a "transgene". The nucleic acid sequence of the transgene, in this case a form of 
any one of the sequences depicted in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 
27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 
77, 79, 81, 83, 85, 87, 89, 91, 93, 95 and 97, maybe integrated either at a locus of a genome 
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where that particular nucleic acid sequence is not otherwise normally found or at the normal 
locus for the transgene. The transgene may consist of nucleic acid sequences derived from the 
genome of the same species or of a different species than the species of the target insect. 

The term "germ cell line transgenic insect" refers to a transgenic insect in which the 
5 genetic alteration or genetic information was introduced into a germ line cell, thereby 

conferring the ability of the transgenic insect to transfer the genetic information to offspring. 
If such offspring in fact possess some or all of that alteration or genetic information, then they 
too are transgenic insects. 

The alteration or genetic information may be foreign to the species of insect to which 
1 0 the recipient belongs, foreign only to the particular individual recipient, or may be genetic 

information already possessed by the recipient. In the last case, the altered or introduced gene 
maybe expressed (i.e., over-expression and knock-out) differently than the native gene. 

Transgenic insects can be produced by a variety of different methods including P 
element-mediated transformation by microinjection (see, e.g., Rubin & Spradling, (1982) 
15 Science 218, 348-353; Orr & Sohal, (1993) Arch. Biochem. Biophys. 301, 34-40), 

transformation by microinjection followed by transgene mobilization (Mockett et ah, (1999) 
Arch. Biochem. Biophys. 371, 260-269), electroporation (Huynh & Zieler, (1999) J. Mol. 
Biol. 288, 13-20) and through the use of baculovirus (Yamao et ah, (1999) Genes Dev. 13, 
511-516. Furthermore, the use of adenoviral vectors to direct expression of a foreign gene to 
20 olfactory neuronal cells can also be used to generate transgenic insects (see, e.g. , Holtmaat et 
ah, (1996) Brain. Res. Mol. Brain Res. 41, 148-156). 

A number of recombinant or transgenic insects have been produced, including those 
which over-express superoxide dismutase (Mockett et ah, (1999) Arch. Biochem. Biophys. 
371, 260-269); express Syrian hamster prion protein (Raeber et ah, (1995) Mech. Dev. 51, 
25 31 7-327); express cell-cycle inhibitory peptide aptamers (Kolonin & Finley (1998) Proc. Natl. 
Acad. Sci. USA 95, 14266-14271); and those which lack expression of the putative ribosomal 
protein S3A gene (Reynaud et ah, (1997) Mol. Gen. Genet. 256, 462-467). 

While insects remain the preferred choice for most transgenic experimentation, in 
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some instances it is preferable or even necessary to use alternative animal species. 
Transgenic procedures have been successfully utilized in a variety of animals, including mice, 
rats, sheep, goats, pigs, dogs, cats, monkeys, chimpanzees, hamsters, rabbits, cows and guinea 
pigs (see, e.g., Kim et al, (1997) Mol. Reprod. Dev. 46, 515-526; Houdebine, (1995) 
5 Reprod. Nutr. Dev. 35, 609-617; Petters, (1994) Reprod. Fertil. Dev. 6, 643-645; Schnieke et 
al, (1997) Science 278, 2130-2133; and Amoah, (1997) J. Anim. Sci. 75, 578-585). 

The method of introduction of nucleic acid fragments into insect cells can be by any 
method which favors co-transformation of multiple nucleic acid molecules. For instance, 
Drosophila embryonic Schneider line 2 (S2) cells can be stably transfected as previously 
1 0 described (Schneider, (1 972) J. Embryol. Exp. Morphol. 27, 353-365). Detailed procedures 
for producing transgenic insects are readily available to one skilled in the art (see Rubin & 
Spradling, (1982) Science 218, 348-353; Orr & Sohal, (1993) Arch. Biochem. Biophys. 301, 
34-40, herein incorporated by reference in their entirety). 

1 5 K. Uses for Agents that Modulate at Least One Activity of DORs 
1. Introduction. 

Organisms, including insects, are continually exposed to a great number of volatiles 
released by other organisms as well as by other aspects of their environment. The olfactory 
receptor genes of the present invention play an important role in the detection and processing 

20 of these chemical stimuli, some of which have been implicated in initiating and modulating 
host-seeking and other behaviors, such as mating behaviors (see, for example, Roth, (1951) 
Ann. Entomol. Soc. Am. 44, 59-74; Jones et al, (1976) Ent. Exp. Appn. 19, 19-22; Gillies, 
(1980) Bull. Ent. Res. 70, 525-532; Kline et al., (1991) J. Med. Entomol. 28, 254-258). For a 
recent, thorough review of the many practical applications of the present invention (see Karg 

25 & Suckling, (1999) Applied aspects of insect olfaction, in: Hansson (ed.), Insect Olfaction, 
Springer, which is incorporated by reference in its entirety). 

Most importantly, the DOR genes of the present invention may be used to track down 
odor receptor genes in insects that damage crops or transmit diseases. The present invention 
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provides the tools and methodologies for finding specific compounds that interfere with the 
insects' ability to detect odors. 

Of course, the present invention has important implications for improved methods of 
using pheromones and other semiochemicals for pest control hi addition, recent 
5 advancements in many other fields have greatly increased the variety of additional 

technologies for which the present invention also has significant applications. Examples of 
such' advancements include, but are not limited to the following: i) the development and 
application of new techniques of chemical identification and synthesis; ii) new chemical 
release techniques; iii) more sophisticated application technologies; and iv) more detailed 
1 0 information about the behavior of specific organisms. 

While not wishing to be bound by the specific embodiments discussed herein, the 
following sections provide an overview of the wide variety of applications for which the 
present invention may be employed. 

2. Definitions. 

1 5 As used herein, the term "allomones" refers to any chemical substance produced or 

acquired by an organism that, when it contacts an individual of another species, evokes in the 
receiver a behavioral or developmental reaction adaptively favorable to the transmitter. 

As used herein, the term "host" refers to any organism on which another organism 
depends for some life function. Examples of hosts include, but are not limited to, humans 

20 which may serve as a host for the feeding of certain species of mosquito and the leaves of 

soybeans (Glycine max(L.)) which may act as hosts for the oviposit of the green cloverworm 
(Plathypena scabra (F.)). 

As used herein, the term "kairomones" refers to any of a heterogeneous group of 
chemical messengers that are emitted by organisms of one species but benefit members of 

25 another species. Examples include, but are not limited to, attractants, phagostimulants, and 
other substances that mediate the positive responses of, for example, predators to their prey, 
herbivores to their food plants, and parasites to their hosts. Kairomones suitable for the 
purposes of the invention and methods of obtaining them are described, for example, Science 
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(1966) 154, 1392-93; Hedin, (1985) Bioregulators for Pest Control, American Chemical 
Society, Washington, 353-366. 

As used herein, the term "pheromone" refers to a substance, or characteristic mixture 
of substances, that is secreted and released by an organism and detected by a second organism 
5 of the same or a closely related species, in which it causes a specific reaction, such as a 

definite behavioral reaction or a developmental process. Examples include, but are not limited 
to, the mating pheromones of fungi and insects. More than a thousand moth sex pheromones 
(Toth et ah, (1992) J. Chem. Ecol. 18, 13-25 ; Arn et al, (1998) Appl. Entomol. Zoo. 33, 
507-51 1) and hundreds of other pheromones have now been identified, including aggregation 

1 0 pheromones from beetles and other groups of insects. Various compositions, including resins 
and composite polymer dispensers, have been developed for the controlled release of 
pheromones have been developed (see, e.g., U.S. Patent No. 5,750,129 & 5,504,142). 

As used herein, the term "semiochemical" refers to any chemical substance that 
delivers a message or signal from one organism to another. Examples of such chemicals 

15 include, but are not limited to, pheromones, kairomones, oviposition deterrents, or stimulants, 
and a wide range of other classes of chemicals (see, for example, Nordlund, (1981) 
Semiochemicals: A review of the terminology, in: Nordlund et ah, (ed.) Semiochemicals: 
Their Role in Pest Control, John Wiley; Howse et al., (1998) Insect Pheromones and Then- 
Use in Pest Management, Chapman & Hall, London). 

20 As used herein, the term "synomones" refers to any chemical substance which benefits 

both the emitter and receiver. Examples include, but are not limited to, compounds involved 
in floral attraction of pollinators and species-isolating mechanisms, such as sex pheromones of 
related species, where an inhibitor often functions to prevent mating among sympatric species. 
As used herein, the term "volatile" refers to a chemical which evaporates readily at 

25 those temperatures and pressures which are considered the relevant temperatures and pressures 
for the reference organism of interest. 

3. As Tools for Further Scientific Research. 

Identification of Olfactory Receptor Genes in Other Organisms. The algorithms of the 
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present invention may be used directly to search for olfactory receptor genes in other 

organisms, as explained elsewhere herein. 

Alternatively, nucleic acid probes or primers may be designed based on the DOR 

genes of the present invention. Such probes or primers may be used to identify and isolate 
5 olfactory receptor genes in other organisms. Methods of creating and using the necessary 

nucleic acid probes and primers are discussed elsewhere herein. 

The highest probability of success in locating olfactory genes in other organisms using 

the DOR genes of the present invention will most likely occur by using a boot-strapping or 

leap-frogging method. Such methods involve first probing organisms most related to fruit 
1 0 flies and successively progressing to more unrelated organisms, using the most newly 

identified olfactory receptor genes to identify similar genes in the next, more unrelated, insect 

of interest. Thus, the first organisms to probe with the DOR genes of the present invention 

most preferably may be other flies from the order Diptera {i.e., the two-winged or true flies). 

Examples of suitable flies include, but are not limited to, the tsetse fly, horse fly, house fly, 
15 bluebottle fly, hover fly and mosquito. Dipterans which transmit diseases causing serious 

health problems are of particular interest (e.g., horse fly, tsetse fly, mosquito). 

After the identification of olfactory receptor genes in various Diptera insects, the next 

organisms to probe most preferably may be from orders within the same subclass as Diptera. 

Finally, the next insects to use would be those from orders not within the same subclass as 
20 Diptera. 

The insects which cause substantial health risks, crop damage, or other significant 
damage (e.g., to housing structure or cotton clothing) may be the most desirable targets for 
such studies. Examples of such insects include, but are not limited to, green cloverworm, 
Mexican bean beetle, potato leafhopper, corn earworm, green stink bug, northern corn 
25 rootworm, western corn rootworm, cutworms, wireworms, thrips, fleas, aphids (e.g., pea 

aphid, spotted alfalfa aphid), European corn borer, fall armyworm, southwestern corn borer, 
grasshoppers, Japanese beetle, termites, leafhoppers (e.g., potato leafhopper, three-cornered 
alfalfa hopper), stink bugs, crickets, Hessian fly, greenbugs and weevils (e.g., alfalfa weevil, 
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bollweevil). 

Olfactory receptor genes identified by this process may then be used to screen non- 
Insecta organisms for olfactory receptor genes. Organisms of interest may include, but be 
limited to, mites, ticks, spiders, nematodes, centipedes, mice, rats, salmon, pigeons, dogs, 
5 horses and humans. 

Genetic Manipulations . The tools and methodologies of the present invention may be 
used by neurobiologists to probe more complex workings of an organism's response system, 
including those of a mammal's brain. 

Knock-outs. By systematically knocking out the olfactory receptor genes of the 
1 0 present invention and observing the effects on odor sensitivity and behavior, researchers will 
be able to piece together a wiring diagram of the olfactory system of the fruit fly. 

The term "knock-out" generally refers to mutant organisms which contain a null allele 
of a specific gene. Methods of making knock-out or disruption transgenic animals, especially 
mice, are generally known by those skilled in the art and are discussed herein and elsewhere 
1 5 (see, for example, the section herein entitled Transgenic Organisms and the following: 

Manipulating the Mouse Embryo, (1986) Cold Spring Harbor Laboratory Press; Capecchi, 
(1989) Science 244, 1288-1292; Li etal, (1995) Cell 80, 401-411; U.S. Patent No. 5,981,830 
& 5,789,654, each of which is incorporated herein by reference. 

Parallel studies may be conducted in other organisms by using the olfactory receptor 
20 genes and the methods of the present invention to identify the olfactory receptor genes of 
other organisms and then creating knock-outs for the olfactory receptor genes of those 
organisms. 

Disabling Genes. Using the olfactory receptor genes of the present invention, it is now 
possible to selectively disable specific DOR genes and look for changes in odor response and 
25 behavior. Parallel studies may be conducted in other organisms by using the olfactory 

receptor genes and the methods of the present invention to identify the olfactory receptor 
genes of other organisms and then disabling olfactory receptor genes of those organisms. 
Methods of disabling genes are generally known by those skilled in the art. An 
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example of an effective disabling modification would be a single nucleotide deletion 
occurring at the begi nnin g of a olfactory receptor gene that would produce a translational 
reading frameshift. Such a frameshift would disable the gene, resulting in non-expressible 
gene product and thereby disrupting functional protein production by that gene. Protease 
production by the gene could be disrupted if the regulatory regions or the coding regions of 
the protease genes are disrupted. 

In addition to disabling genes by deleting nucleotides, causing a transitional reading 
frameshift, disabling modifications would also be possible by other techniques including 
insertions, substitutions, inversions or transversions of nucleotides within the gene's DNA that 
would effectively prevent the formation of the protein coded for by the DNA. 

It is also within the capabilities of one skilled in the art to disable genes by the use of 
less specific methods. Examples of less specific methods would be the use of chemical 
mutagens such as hydroxylamine or nitrosoguanidine or the use of radiation mutagens such as 
gamma radiation or ultraviolet radiation to randomly mutate genes, such as the DOR genes of 
the present invention. Such mutated strains could, by chance, contain disabled olfactory 
receptor genes such that the genes are no longer capable of producing functional proteins for 
any one or more of the domains. The presence of the desired disabled genes could be detected 
by routine screening techniques. For further guidance, see U.S. Patent No. 5,759,538. 

Over-expression. Using the olfactory receptor genes of the present invention, it is now 
possible to selectively over-express specific DOR genes and look for changes in odor response 
and behavior. Parallel studies may be conducted in other organisms by using the olfactory 
receptor genes and the methods of the present invention to identify the olfactory receptor 
genes of other organisms and then overexpress the olfactory receptor genes of those 
organisms. 

Methods of overexpressing genes are generally known by those skilled in the art. For 
examples of producing cells which overexpress specific genes, see, for example, U.S. Patent 
Numbers 5,905,146; 5,849,999; 5,859,311; 5,602,309; 5,952,169 and 5,772,997 (HER2 
receptor). 
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Modulating or Inhibiting Expression. Using the olfactory receptor genes of the 
present invention, it is now possible to selectively modulate or inhibit specific DOR genes 
using antisense oligomers which specifically hybridize with the DNA or RNA encoding the 
DOR genes. One skilled in the art could so modulate or inhibit the expression of the DOR 
genes and detect for changes in odor response and behavior. Parallel studies may be 
conducted in other organisms by using the olfactory receptor genes and the methods of the 
present invention to identify the olfactory receptor genes in other organisms and then use 
antisense oligers to the olfactory receptor genes of those organisms. Methods for inhibiting 
expression of genes, especially genes coding for receptor genes, using antisense constructs, 
including generation of antisense sequences in situ are described, for example, in U.S. Patent 
Numbers 5,856,099; 5,556,956; 5,716,846; 5,135,917 and 6,004,814. 

Other methods that can be used to inhibit expression of an endogenous gene are 
applicable to the present invention. For example, formation of a triple helix at an essential 
region of a duplex gene serves this purpose. The triplex code, permitting design of the proper 
single stranded participant is also known in the art. (See H. E. Moser, et al, (1987) Science 
238: 645-650 andM. Cooney, et al, (1988) Science 241: 456-459). Regions in the control 
sequences containing stretches of purine bases are particularly attractive targets. Triple helix 
formation along with photocrosslinking is described, e.g., in Praseuth et al, (1988) Proc. Natl 
Acad. Sci. USA 85:1349-1353. 

Studying Behavior . The present invention is useful for studying the developmental 
aspects of the olfactory receptor genes which appear to be active at different times during 
development. Such studies may help organize the olfactory systems in various organisms and 
may help explain the behavior of various organisms. 

The tools and methodologies of the present invention may be used to study the 
influence of environmental conditions on pheromone communication. For example, newly 
identified olfactory receptor genes may be used to study the effects of different rearing 
temperatures and light regimes (selected to mimic those occurring in the spring and summer 
growing seasons) on the response of various Lepidoptera insects, such as the cabbage looper 
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moth (Trichoplnsia ni (Hubner)). For a description of the methods which might be used for 
such a study, see, for example, Grant et al, (1996) Physiol. Entomol. 21, 59-63. 

4. For Organism Detection, Monitoring and Control. 

General Pest Management . The olfactory receptor genes identified herein and 
identified using the methods of the present invention may be used to identify compounds 
which may be used for pest management. It is especially desirable to utilize various aspects of 
the present invention for pest management related to crop protection. 

The application of pheromones is now firmly established as a key component of pest 
management and control, especially within the framework of integrated pest management 
(TPM). An object of organism control is to modulate an organism's behavior or activity so as 
to reduce the irritation, sickness, or death of the host (e.g., a plant host), or to decrease the 
general health and proliferation of the organism. 

For example, the propagation of a mouse population in a given area of actual or 
potential mice infestation may be prevented or inhibited by treating such an area with an 
effective amount of male mouse pheromones, wherein such pheromones have male mouse 
aversion signaling properties (see, e.g., U.S. Patent No. 5,252,326). 

Insect Repellents and Insecticides . The present invention provides the tools and 
methodologies useful for identifying compounds which modulate insect behavior by 
exploiting the sensory capabilities of the target insect. For example, attempts have been made 
to describe and synthesize the complex interactions which underlie host-seeking behavior in 
mosquitoes. Using the methods and olfactory receptor genes of the present invention, it is 
possible to design specific compounds which target mosquito olfactory receptor genes. Thus, 
the present invention provides the ability to alter or to eliminate the orientation and feeding 
behaviors of mosquitoes and thereby have a positive impact on world health by controlling 
mosquito-borne diseases, such as malaria. 

Mosquito olfactory receptor genes may be identified and/or targeted using various 
aspects of the present invention. For example, the olfactory receptor genes of the present 
invention may be used to design probes as discussed elsewhere herein for the identification 
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and characterization of mosquito olfactory receptor genes. Alternatively, the algorithm of the 
present invention may be used to identify mosquito olfactory receptor genes in the genetic 
databases for mosquitoes. Once the mosquito olfactory receptor genes are identified, then 
various screening methods described elsewhere herein, such as the high throughput assays 
5 discussed elsewhere herein, may be used to identify synthetic and natural compounds which 
may modulate the behavior of the insect. 

Mating Enhancement and Disruption . The olfactory receptor genes identified herein 
and identified using the methods of the present invention may be used to identify compounds 
which interfere with the orientation and mating of a wide range of organisms, including 
1 0 insects. Thus, the present invention enables the identification of compositions which disrupt 
insect mating by selective inhibition of specific receptor genes involved in mating attraction 
(see, e.g., U.S. Patent No. 5,064,820). 

Animal Repellants . The olfactory receptor genes identified herein and identified using 
the methods of the present invention may be used to identify compounds which may be used 
15 as animal repellants. Such compositions may be used to repel both predatory and non- 
predatory animals (see, e.g., U.S. Patent No. 4,668,455). 

6. Organism Attraction. 

Insect Attractants . The olfactory receptor genes identified herein and identified using 
the methods of the present invention may be used to identify compounds which attract 
20 specific insects to a particular location (see, e.g., U.S. Patent No. 4,880,624 & 4,851,218). 

For example, aspects of the present invention may to used in various methods which 
reduce or eliminate the levels of particular insect pests, such as mosquitoes and tsetse flies. 
As a particular example, insect traps can be created wherein the pheromone attracts a 
particular insect, like the tsetse fly, and the insect so attracted dies in the trap. In this way, the 
25 population of tsetse flies may be reduced or eliminated in a particular area. 

The insect attractant compositions so identified may also be combined with an 
insecticide, for example as an insect bait in microencapsulated form. Alternatively, or in 
addition, the insect attractant composition may be placed inside an insect trap, or in the 
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vicinity of the entrance to an insect trap. 

In addition to killing insects, the trapping of insects is often very important for 
estimating or calculating how many insects of a particular type are feeding within a specific 
area. Such estimates are used to determine where and when insecticide spraying should be 
commenced and terminated. 

Insect traps which may be used are, for example, those as described in 
PCT/BG93/01442 and U.S. Patent No. 5,713,153. Specific examples of insect traps include, 
but are not limited to, the Gypsy Moth Delta Trap®, Boll Weevil Scout Trap®, Jackson trap, 
Japanese beetle trap, McPhail trap, Pherocon 1C trap, Pherocon II trap, Perocon AM trap and 
Trogo trap. 

Kairomones may be used as an attractancy for the enhancement of the pollination of 
selected plant species. 

Attractant compositions which demonstrate biological activity toward one sex which 
is greater than toward the opposite sex may be useful in trapping one sex of a specific 
organism over another. For example, a composition may be a highly effective attractant for 
male apple ermine moths (Yponomeuta malinellus (Zeller)) and not so effective an attractant 
for female apple ermine moths. By attracting adult males to field traps, the composition 
provides a means for detecting, monitoring, and controlling this agricultural pest (see, e.g., 
U.S. Patent No. 5,380,524). 

Attracting Predators and Parasitoids . The olfactory receptor genes of the present 
invention and the olfactory receptor genes identified using the methods of the present 
invention may also be used to identify chemicals which attract various predators and 
parasitoids. Attracting the predators and parasitoids which attack certain pests offers an 
alternative method of pest management. 

Animal Attractants . The olfactory receptor genes identified herein and those identified 
by the methods of the present invention may be used to identify chemicals which attract 
household domesticated animals. For example, a pheromone-containing litter preparation 
may attract the animals and absorb liquids and liquid-containing waste released by the 
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attracted animal (see, e.g., U.S. Patent No. 5,415,131). 

Synthetic Perfumes . A "perfume" or a "fragrance composition" is a specific 
pleasantly odorous cosmetic composition for topical application to an individual. The 
olfactory receptor genes identified herein and those identified by the methods of the present 
5 invention may be used to identify chemicals which may be produced and used as synthetic 
perfumes. Such perfumes may be used to disguise odors or enhance attraction between 
humans (see, e.g., U.S. Patent No. 5,278,141). 

7. Pharmaceuticals. The olfactory receptor genes identified herein and those 
identified using the methods of the present invention may be used to identify pharmaceutical 

1 0 compounds useful for altering the behavior and physiology of animals. Examples of such 

compounds include, but are not limited to, certain Androstene steroids that effectuate a change 
in human hypothalamic function (see, e.g., U.S. Patent No. 5,969,168). 

8. Industrial Applications. The olfactory receptor genes identified by the methods 
of the present invention may be used for a number of different industrial applications 

1 5 including, but not limited to the following : 

a) Identification of appetite suppressant compounds and using same to suppress and/or 
control appetite. 

b) Trapping odors of a specific type. 

c) As Biosensors. 

20 1) Explosive and drug detectors. The detectors may be synthetic, such as biologically- 

inspired robotic sensors, or biological sensors, such as sniffing dogs which are especially 
sensitive to certain odors. 

2) Population of olfactory receptor genes expressed in cell culture. Olfactory receptor 
genes can be introduced into a cell line and the transformed cells maintained in culture 

25 through multiple generations. By creating specific cell lines which express multiple olfactory 
genes at once, it would be possible to use such cell cultures to investigate how odorants 
interact with odorant receptor genes. Thus, the present invention provides methods for 
identifying odorant fingerprints, wherein such methods include contacting a series of cells 
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containing and expressing known odor receptor genes with a desired sample, and determining 
the type and quantity of the odorant ligands present in the sample (see, e.g., U.S. Patent No. 
5,993,778). As discussed elsewhere herein, the interaction of substances with the receptors 
can be identified using appropriate labels, such as those provided by luciferase, the jellyfish 

5 green fluorescent protein (GFP) or p-galactosidase. 

3) Biochip Arrays. As discussed elsewhere herein, biochip arrays of odorant receptor 
genes can be generated. The arrays may be used to detect olfactory receptor ligands via an 
appropriate marker or via a chemical or electrical signal. Arrays may be designed for specific 
purposes, such as, but not limited to, detecting perfumes, explosives, drugs, pollutants, and 

10 toxins. 

d) Training organisms to conduct certain tasks. Examples include, but are not limited to, 
the following: 

1) Training mice to pull guide line for stringing fiber optic cable through existing 
conduit holding copper wire. 
! 5 2) Training mice to find their way through a maze based on smell (see, e.g., Otto et al, 

(1991) Hippocampus 1, 181-192; Granger et al, (1991) Psych. Science 2, 1 16-1 18). 

3) Improving the orientation and homing performance of pigeons (see, e.g., Wiltschko, 
(1996) J. Exp. Biol. 199, 113-119) and fish (see, e.g., Cao et al. (1998) Proc. Natl. Acad. Sci. 
USA95(20):11987-11992), 
20 4) Orient or reorient the behavior of worker bees of a rearing colony by incorporating a 

composition which includes one or more pheromones which elicits particular bee behavior 
towards the larvae. Thus, the beekeeper may orient or reorient the bees towards a particular 
activity such as, but not limited to, inducing improved acceptance of the larvae at the 
beginning of rearing, to increase the production of royal jelly, regulate the feeding of the 
25 larvae as to favor the development of queen bees, etc. (see, e.g. , U.S. Patent No. 5,695,383). 

Without further description, it is believed that one of ordinary skill in the art can, using 
the preceding description and the following illustrative examples, make and utilize the 



-52 - 



Attorney Docket No. 44574-5061 

compounds of the present invention and practice the claimed methods. The following 
working examples therefore, specifically point out the preferred embodiments of the present 
invention, and are not to be construed as limiting in any way the remainder of the disclosure. 

5 EXAMPLES 

Exam ple 1: Identification of candidate olfactory receptor genes 

hi vertebrates and nematodes it is estimated that there are hundreds of olfactory 

receptor genes, widely distributed in the genome (Buck & Axel, (1991) Cell 65, 175-187; 

Troemel et al, (1995) Cell 83, 207-218). With approximately 10% of the Drosophila 
1 0 genome sequenced, it was likely that some of the Drosophila odorant receptor genes have 

been sequenced. A two-step strategy was developed to identify odorant receptor genes from 

the genomic database. First, a computer algorithm was designed to search the Drosophila 

genomic sequence for open reading frames (ORFs) from candidate odorant receptor genes. 

Second, RT-PCR was used to determine if transcripts from any of these ORFs were expressed 
1 5 in olfactory organs. Finally, in situ hybridization was used to localize expression of DOR 

genes. 

Step 1: Computer algorithm for identification of GPCR genes . The algorithm used to 
identify GPCR genes used statistical characterization of amino acid physico-chemical profiles 
in combination with a non-parametric discriminant function. The key approach is to use the 

20 information in the interplay between the local structure (transmembrane alpha helix) and the 
global structure (repeated multiple domains) and characterize this information with concise 
statistical variables. The algorithm was trained on a set of 1 00 putative GPCR sequences from 
the GPCR database (GPCRDB) at http://swifl.embl-heidelberg.de/7tm and a set of 100 
random proteins selected from the SWISSPROT database (this framing set was later 

25 expanded, but that version was not used for the genes reported in this paper). In the first step, 
three sets of descriptors were used to summarize the physico-chemical profiles of the 
sequences. These were GES scale of hydropathy (Engelman et al, (1986) Annu. Rev. 
Biophys. Biophys. Chem. 15, 321-353), polarity (Brown, (1991) Molecular Biology Labfax, 
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Academic Press), and amino acid usage frequency. For the first two of these measurements, a 
sliding window profile was employed (White, (1994) Membrane Protein Structure, Oxford 
University Press) using a kernel of 15 amino acid constant function convoluted with a 16 
amino acid Gaussian function. These profiles were then summarized with three statistics; the 

5 periodicity (characterizing the quasi-periodic presence of the transmembrane domain), average 
derivative (characterizing the abrupt change between the transmembrane domain and 
non-transmembrane domain), and the variance of the derivative (also characterizing the abrupt 
change). GES periodicity, variance of polarity derivative, polarity periodicity and amino acid 
frequency were used as the four variables and each sequence was therefore characterized by 

1 0 four variables. These four variables were used in a non-parametric linear discriminant function 
that was then optimized to separate the known GPCRs from random proteins in the training 
set. The same linear discriminant function with the scores derived from the framing set was 
then used to screen the genomic database for candidate genes. The candidate sequences were 
given significance values by an odds ratio of the GPCRs and non-GPCRs computed using the 

1 5 observed empirical distribution of the framing set. More detailed information about the 
algorithm is available at http://www.neuron.Org/cgi/content/full/22/2/327/dcl. 

The computational screens used the genomic sequence data obtained by FTP from the 
Berkeley Drosophila Genome Project (BDGP, http://www.fruitfly.org, version 6/98). First, 
the ORFs of 300 bases or longer in all six frames were identified. Next, a program written to 

20 identify GPCRs statistically by their physico-chemical profile was used to screen for candidate 
ORFs as described above. The number of possible candidates was reduced by comparing 
them to Drosophila codon usage tables (http://flybase.bio.indiana.edu, version 10). Candidate 
ORFs whose codon usage differed at a significance level of 0.0005 by the chi-square statistic 
were discarded from the candidate set. Using these screening steps, 34 candidate ORFs were 

25 obtained. 

Further analysis revealed that eight of the thirty-four candidate ORFs corresponded to 
genes of known function, for example a cyclic nucleotide-gated channel (Baumann et al, 
(1994) EMBO J. 13, 5040-5050) and these ORFs were not further analyzed. Most of the 
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remaining ORFs encoded fewer than seven predicted transmembrane domains. The genomic 
DNA surroimding each of the computer-identified ORFs was therefore examined for the 
presence of neighboring ORFs encoding additional transmembrane domains to which the 
original ORFs might be spliced. Drosophila 5' and 3' intron-exon consensus splice sequences 

5 were used in this analysis to help identify linked exons (Mount et al, (1 992) Nucleic Acids 
Res. 20, 4255-4262). This analysis yielded several genes that encoded 
seven-transmembrane-domain proteins (22A.1 and 22A.2). 

Step 2: Sequence analysis of DOR olfactory genes . To determine if these two 
candidates were part of a larger family of genes encoding seven-transmembrane-domain 

1 0 proteins, BLAST searches of the Drosophila genome database were conducted using the 

candidate gene sequences to identify related genes (Altschul et al, (1990) J. Mol. Biol. 215, 
403-410). The computer algorithms employed identified the ORFs for the second exons of 
22A.1 and 22A.2, which encode transmembrane domains 1-4. These ORFs are on the BDGP 
PI clone designated DS005342. The DS005342 sequence was examined around the initial 

1 5 ORFs for neighboring ORFs which encoded additional potential transmembrane domains. 
Key to the identification of these neighboring ORFs was the presence of intron-exon 
consensus splice sequences: GTRAGT for the 5' end and HAG for the 3' end (Mount et al, 
(1992) Nucleic Acids Res. 20, 4255-4262). 22A.1 and 22A.2 were found to have two other 
introns in corresponding locations, all of which had conserved splice sequences. 

20 The amino acid sequences of 22 A. 1 and 22A.2 were used in searches of the 

Drosophila genome database using the tBLASTn program of the BDGP. These searches 
yielded partial sequences of other members of the DOR family. To complete the sequences of 
these genes, an analysis of the genomic DNA around each identified ORF was carried out as 
was done for 22A.1 and 22A.2, using the locations of conserved introns in the genes, the 

25 intron consensus splice sequences, and the tBLASTn alignments as guides. Use of the genes 
identified in the second round as query sequences in tBLASTn searches and subsequent 
similar analysis of genomic DNA yielded the remaining genes. Additional searches of 
GenBank and SwissProt databases were performed with the NCBI (National Center for 
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Biotechnology Information) BLAST network. 

The sequence alignment in Figure 3 is based on the alignments predicted by the 
tBLASTn program of the BDGP but was edited extensively. The 5' splice sequences for the 
most 3' introns of both 2F.1 and 47E.1 were unfavorable. It was assumed that these introns 
5 were spliced nonetheless, as the resulting amino acid sequence displayed greater sequence 

identity to other DOR family members. If these introns were not spliced out, then the lengths 
of 2F.1 and 47E.1 would not be significantly altered from the lengths indicated in Figure 3. 
2F.1 was independently predicted to be a gene (GenBank accession number 2661571) by the 
EMBL genefinder program subsequent to the submission of the provisional application to 

1 0 which this application claims priority. 

Homologs of the two candidates were found, and their sequences were used in turn for 
further database searches. In total, forty-nine genes have been identified from the 
approximately 16% genomic sequence currently available. Applicants have tentatively named 
this family of genes DOR (for Drosophila Olfactory Receptor), and each individual gene was 

1 5 named based upon its cytogenetic location in the genome. Thus the two genes identified 
initially are DOR22A.1 and DOR22A.2, which were abbreviated here as 22A.1 and 22A.2 
(the final digit in this nomenclature is used to distinguish the genes at a site and does not refer 
to the cytogenetic band number). The genomic locations of all the DOR genes identified so 
far are indicated in Figure 2A, and an alignment of their amino acid sequences is presented in 

20 Figure 3. Of me forty-nme family members, the great majority have been found to be 

expressed in either the antenna or the maxillary palp, or in both, based upon RT-PCR analysis 
(Table 1) and in situ hybridizations to RNA in tissue sections. 

The DOR genes have no significant similarities to any known genes, and do not 
appear in any of the Drosophila EST databases. However, Kyte-Doolittle hydropathy plots of 

25 the predicted proteins show that each has approximately seven peaks that could represent 

transmembrane domains (Figure 2C) (Kyte & Doolittle, (1982) J. Mol. Biol. 157, 105-132). 
The lengths of the sixteen proteins are between 369 and 403 amino acids, similar to the 
lengths of most previously described families of GPCRs (Probst et al, (1992) DNA Cell Biol. 
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11, 1-20). In addition, the spacing of the putative transmembrane domains gives rise to 
predicted intracellular and extracellular loops similar in size to those in many families of 
GPCRs (Probst et al, (1992) DNA Cell Biol. 11, 1-20). 

Amino acid sequence identity among the DOR genes ranges from approximately 
5 10-75%, with many genes showing a relatively low level of identity to each other 

(approximately 20%). Two pairs of clustered genes, 22A.1/22A.2 and 33B.1/33B.2 show the 
highest identity, with 75% and 57% homology, respectively. However, not all clustered genes 
show high degrees of similarity. 33B.3, for example, is only 28% identical to both 33B.1 and 
33B.2 and 46F.1 and 46F.2 are only 29% identical. In addition to exhibiting sequence 

1 0 identity, many of the genes contain introns in corresponding locations (Figure 3), consistent 

with their constituting a family derived from a common ancestral gene. Examples of genomic 
DNA encoding the complete structural gene for DOR proteins containing the introns can be 
found in SEQ ID NO: 99-1 14, while the corresponding cDNA containing the intact ORF can 
be found in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 and 31. 

1 5 There are sixty-seven residues that are conserved among at least 50% of the genes, 

and most of these (49) are in the C-terminal halves of the proteins (Figure 3). Among the 
conserved residues are a serine and a threonine in the intracellular C-tenninal tail, residues 
frequently conserved in this region of GPCRs (Probst et al, (1992) DNA Cell Biol. 1 1, 1-20). 
The most divergent region in the sequences is a stretch of thirty amino acids representing part 

20 of the first extracellular loop and nearly all of transmembrane domain three. The divergence 
in this region also occurs in the most conserved pairs of genes: 22A. 1 and 22A.2 are 75% 
identical overall, but only 50% identical in this region, and 33B.1 and 33B.2 are 57% identical 
overall, but only 33% identical in this region. This divergence has also been observed in other 
species. In particular, transmembrane domains three, four and five were exceptionally 

25 divergent in rat odorant receptors and have been proposed to play a role in odorant binding 
(Buck & Axel, (1991) Cell 65, 175-187). 

Some of the genes are clustered in the genome (Figure 2A), while others are 
apparently isolated. Within a cluster the average intergenic distance is on the order of 500 
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bases. Clustered DOR genes do not necessarily have introns in corresponding locations (e.g. 
46F.1 and 46F.2), but all clustered genes have their transcriptional orientations in the same 
direction (Figure 2A). At least one of the DOR genes (2F.1) is flanked closely on both sides 
by two apparently unrelated genes (Figure 2B) (Haenlin et al, (1987) EMBO J. 6, 801-807). 

5 A novel strategy to search the Drosophila genomic sequence database for genes 

encoding potential GPCRs was employed, leading to the identification of a multigene family 
with properties expected of odorant receptors. In addition to these genes, a wide variety of 
other transmembrane proteins were identified by this strategy, a few previously identified by 
other means and many representing novel proteins with similarity to known transmembrane 

1 0 proteins. These results suggest that the algorithm may be of widespread use in identifying 
new receptors, channels, and other transmembrane proteins. 

The family of candidate odorant receptor genes currently contains forty-nine members, 
identified from the 16% of the Drosophila genomic sequence that is available. By 
extrapolation the size of this family may be on the order of 100 genes, making it the largest 

1 5 gene family identified in Drosophila . 

There are several lines of evidence indicating that these genes encode Drosophila 
odorant receptors. First, the predicted proteins encoded by the genes each contain 
approximately seven potential transmembrane domains, as expected of GPCRs. Second, 
genes are expressed in one or both of the two olfactory organs, and for a number of genes this 

20 expression is restricted to a subset of olfactory receptors, as expected for odorant receptors. 

Third, the large number of family members, and the clustered location of many of these genes 
in the Drosophila genome, is reminiscent of odorant receptors in other organisms. 

Additional lines of evidence is available which indicates DOR proteins as odor 
receptors. First, antibodies raised against the product of the DOR22A.2 gene label a small 

25 number of sensilla on the fly's antenna whose location corresponds to the same region labeled 
by in situ hybridization. Most important, staining appears localized to the cavities of the 
labeled sensilla, where the dendritic cells are located. This is exactly the localization expected 
of an odorant receptor. Second, different DOR genes are expressed (as determined by in situ 



-58- 



Attorney Docket No. 44574-5061 

hybridization) in different subsets of olfactory receptor neurons, as expected of odor receptor 
genes. Third, as expected, the number of olfactory receptor neurons labeled by individual 
DOR genes corresponds with the number of olfactory receptor neurons exhibiting a particular 
odor-sensitivity because the number of neurons expressing a particular DOR gene is predicted 
5 to equal the number of neurons with a particular odor response spectrum. Finally, many of the 
DOR genes are not expressed in the Acj6 POU-domain transcription factor mutant, where a 
subset of olfactory receptor neurons displayed abnormal odorant specificities. A correlation 
between DOR gene expression and odorant-specificity therefore exists, as is expected with 
odorant receptor genes. 

1 o Comparison of the sequences of these candidate odorant receptors to those from other 

organisms shows that they are extremely divergent from known odorant receptors and other 
GPCR families. This is not surprising, as searches for these genes based on sequence 
similarity to odorant receptors from other organisms had not succeeded, and the odorant 
receptor families in vertebrates and C. elegans are essentially unrelated. There is a great deal 

1 5 of sequence divergence among the DOR genes, much more than among the rat sequences 
previously reported (Buck & Axel, (1991) Cell 65, 175-187), for example. Moreover, 
genomic Southern blots have shown that none of nine DOR genes tested defines a subfamily 
of more than two or so well-conserved genes. The DOR family therefore differs in this 
respect from the mouse family, for example, where most odorant receptor genes belong to 

20 subfamilies of approximately seven to ten genes (Ressler et al, (1993) Cell 73, 597-609). 

Although at present the clusters of DOR genes identified thus far contain smaller 
numbers of genes (less than three) than in other organisms (Troemel et al, (1995) Cell 83, 
207-218; Sullivan et al, (1996) Proc. Natl. Acad. Sci. USA 93, 884-888; Barth etal, (1997) 
Neuron 19, 359-369), a number of interesting features of the clustered genes are already 

25 apparent. As found in other organisms (Barth et al, (1997) Neuron 19, 359-369), Drosophila 
odorant receptor genes within a cluster are not necessarily coordinately regulated, such that 
genes within a cluster are expressed in different classes of cells, and even in different olfactory 
organs (e.g. 46F.1 is expressed in the maxillary palp whereas 46F.2 is expressed in the 
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antenna). So far, all genes identified within a cluster, however, are transcribed in the same 
orientation. Genes within a cluster sometimes do, but sometimes do not, share intron 
positions, suggesting that introns may have become lost following gene duplication; a 
phylogenetic study revealed extensive gene duplication and intron loss among the 

5 chemoreceptor genes of C. elegans (Robertson, (1998) Genome Res. 8, 449-463). 

Step 3: Identification of olfactory receptor genes using RT-PCR . RT-PCRwith 
primers designed from two of these final candidates yielded amplification products from 
antennal cDNA. From RT-PCR experiments, the two genes did not appear to be expressed in 
the maxillary palp, abdomen, thorax, or head from which olfactory organs had been removed, 

1 0 suggesting that these genes were expressed specifically in the antenna. These two genes are 
located within 500 base pairs of each other at cytological position 22A (Figure 2A), and their 
predicted proteins are 75% homologous at the amino acid level. 

For preparation of RNA, individual flies were frozen in liquid nitrogen, and antennae 
and maxillary palps were dissected. On average 150 antennae or 200 maxillary palps were 

1 5 used for RNA preparation. Total RNA was prepared as described elsewhere (McKenna etal, 
(1994) J. Biol. Chem. 269, 16340-16347). The RNA was treated with DNasel (Gibco-BRL) 
for thirty minutes at 37°C, phenol/chloroform extracted, and precipitated. The entire RNA 
preparation was used for oligo dT-primed cDNA synthesis using Superscript II Reverse 
Transcriptase (Gibco-BRL) according to the manufacturer's directions. PCR was performed 

20 using Taq polymerase (Sigma) under standard cycling conditions, with an annealing 
temperature of 60°C, gene-specific primer concentration of 1 pM, and magnesium 
concentration of 2.5 mM. For all genes except 2F.1, primer pairs which span introns were 
used in order to distinguish PCR bands amplified from cDNA from those amplified from any 
remaining genomic DNA. 

25 

Example 2: Hybridization of DOR gene probes to related sequences 

To determine whether any of the DOR genes have closely related homologs, coding 
regions from nine of the genes were used to probe Southern blots of Drosophila genomic 
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DNA at high or low stringency. For the closely related genes such as 22A.1 and 22A.2, a 
combined probe was used. For genomic southern blots, hybridizations were at 65°C (high 
stringency) or 55°C (low stringency), in 7% SDS, 0.5 M sodium-phosphate buffer pH 7.2, 1 
mM EDTA, pH 8.0. 

5 Each probe detected only its own sequence at high stringency, while at low stringency 

most gene probes detected one or two novel bands (data not shown). As expected, because of 
the overall low level of similarity, none of these extra bands corresponded to any of the other 
known DOR genes. These data indicate that some of these genes have one or two closely 
related homologs, but that none belongs to a large subfamily of highly related genes. 

10 

Exam ple 3: Localization of DOR gene expression 

Olfactory receptor neurons of the adult fly are located in both the antenna and the 
maxillary palp. To ask whether any of the DOR genes are expressed in these neurons, in situ 
hybridization was carried out using adult tissue sections. 

1 5 For in situ hybridization experiments, coding regions of the DOR genes were 

subcloned into the pGEM-T Easy vector (Promega). Digoxygenin-labeled RNA probes were 
generated and hydrolyzed according to the manufacturer's instructions (Boehringer 
Mannheim). In situ hybridizations to RNA in tissue sections were performed using a 
modified version of procedures described elsewhere (Roberts, (1998) Drosophila: A Practical 

20 Approach, Oxford University Press; Chadwick & McGinnis, (1 987) EMBO J. 6, 779-789). 

Briefly, heads were dissected from animals and fixed in 4% paraformaldehyde/PBS for fifteen 
minutes. Tween-20 was then added to 0.1% and heads were fixed for an additional thirty 
minutes. Samples were washed twice for five minutes in 0.1% Tween 20/PBS (PBST), cut 
into 8 um frozen sections, and mounted on poly-L-Lysine treated slides (Sigma). Sections 

25 were dried onto slides for thirty minutes at room temperature and then fixed for an additional 
thirty minutes in 4% paraformaldehyde/PBST. Samples were washed for a total of two hours 
in PBST with five changes of buffer, followed by an incubation for five minutes in 1 : 1 
PBST:hybridization buffer (50% formamide, 5x SSC, 50 mg/ml heparin, 0.1% Tween 20), 
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and then prehybridized for two hours at 55°C. 

Of eleven genes examined, seven displayed detectable expression, which in every case 
was restricted to the olfactory organs (Table 2). The 46F.1 probe hybridized to a subset of 
olfactory receptors in the maxillary palp (Figure 4A). Counting of labeled olfactory receptors 

5 in serial sections revealed that the total number of 46F. 1 -staining olfactory receptors per 

maxillary palp was 18±1 (Table 2), or 15% of the 120 olfactory neurons in the maxillary palp. 
A similar number of neurons, 17±1, was labeled by another probe, 33B.3 (Figure 4B). The 
neuronal identity of the labeled cells was apparent from the presence in many cases of a 
well-defined axon projecting from the labeled cell body and joining the maxillary nerve 

1 0 (Figures 4B-C). For both probes, the labeled neurons were distributed broadly over the 

olfactory surface of the organ, and were interspersed among unlabeled neurons (Figures 4A- 
C). Staining in many cells appeared annular, which was interpreted to reflect a perinuclear 
distribution ofmRNA, as expected of an mRNA present at highest concentrations in the cell 
bodies of these olfactory receptors (Figure 4B). The 33B.3 and 46F.1 genes are evidently 

1 5 expressed in different subsets of olfactory receptors, because the number of neurons 

hybridizing with a mixed probe was greater than the number of neurons that hybridized when 
either probe was used individually (data not shown). No hybridization detected in the 
antenna, head, or thorax for either probe. 

Many of the DOR genes are expressed in the antenna and not in the maxillary palp, as 

20 determined by RT-PCR (Table 1). For several genes this localization was confirmed by in 
situ hybridization. The 47E. 1 probe hybridized to 40±1 cells in a broad area across the 
antenna (Figures 5A-B), including both anterior and posterior faces, similar to the distribution 
pattern of small s. basiconica (Figure IF). A probe from the 25 A. 1 gene hybridized to fewer 
cells, 16±1, but in aregionofthe antenna similar to that of 47E.1 staining, as judged by 

25 reconstruction of serial sections (Figure 5C-D). The 22A.2 probe hybridized to 22±1 cells in a 
different distribution, clustered in the dorso-medial region of the antenna (Figure 5E). This 
pattern matches the distribution of the large s. basiconica (Figure IE). The expression 
patterns of the three genes in the antenna are illustrated schematically in Figure 5G. None of 
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these three probes revealed expression in the maxillary palp, head, or thorax. This data 
demonstrates that the DOR family is expressed in olfactory receptors, and that the expression 
of individual members is restricted to distinct subsets of cells in the olfactory organs. 

The number and broad distribution of maxillary palp neurons expressing 46F.1 and 
33B.3 are intriguing in light of electrophysiological studies. There are approximately 120 
olfactory receptors on the palp, which fall into six different classes based upon their odorant 
response profiles. Each class contains roughly equal numbers of neurons, distributed broadly 
over the olfactory surface of the palp. Thus, if an individual receptor gene is expressed in all 
olfactory receptors of a functional class, one might expect a gene to be expressed in a broad 
distribution, in approximately twenty neurons, in good agreement with the distribution and 
numbers observed for both 46F.1 and 33B.3 (18±1 and 17±1, respectively). 

The two DOR genes whose expression was detected by in situ hybridization in the 
maxillary palp are expressed in olfactory receptors housed within s. basiconica, the only 
morphological class of sensilla on the palp. In the antenna, the 22A.2 probe consistently 
hybridized to a subset of cells in a portion of the dorso-medial region of the antenna that 
contains almost exclusively large s. basiconica (Figure IE). The 47E.1 and 25 A. 1 probes 
hybridize to subsets of cells in a distinctly different region of the antenna which may correlate 
with the distribution of small s. basiconica, of which at least two functional types are 
mterrningled (Figure IF). Of particular interest, the numbers of cells to which 47E.1 and 
25A.1 hybridize are different: 40±1 and 16±1; one possible interpretation is that they are 
expressed in distinct functional types of small s. basiconica. This region also contains s. 
trichodea and s. coeloconica, and although the labeling patterns do not correlate with the 
distribution of either of two functional classes of s. trichodea (Clyne et ah, (1997) Invert. 
Neurosci. 3, 127-135), a definitive identification of the sensillar type may require further 
investigation. If in fact all the DOR genes are expressed in only one of the morphological 
categories of sensilla, the s. basiconica, it is possible that there are other, as yet unidentified, 
families of receptors that are expressed in the other morphological categories of sensilla. This 
would mean that the number of odorant receptors in Drosophila might be substantially larger 
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than one-hundred. 

Applicants have identified three DOR genes that are expressed in the maxillary palp 
(Table 1), from the 16% of the genome analyzed. As these three genes, like most DOR genes, 
are not clustered in the genome, linear extrapolation suggests that the entire genome contains 

5 on the order of eighteen DOR genes expressed in the maxillary palp, an organ which has six 
functional classes of neurons (Clyne et al, (1999) Neuron 22, 339-347; de Bruyne et al, 
(1999) J. Neurosci. 19, 4520-4532). If all neurons within a functional class, i.e. with the same 
odor-specificity, are identical in terms of their receptor expression, then the ratio of expressed 
genes to neuronal classes in this organ would be consistent with a model in which an 

1 0 individual ORN expresses a small number of odorant receptors; however, further data is 

needed to establish conclusively the number of receptor genes expressed per cell. Olfactory 
neurons in other organisms appear to lie at either of two extremes: in the vertebrates, it is 
believed only one receptor is expressed per ORN (Ngai et al, (1993) Cell 72, 667-680; 
Ressler et al, (1993) Cell 73, 597-609; Vassar et al, (1993) Cell 74, 309-318); in C. elegans, 

1 5 approximately 550 chemoreceptors are likely to be distributed amongst fourteen classes of 
chemosensory neurons (Troemel et al, (1995) Cell 83, 207-218). 

Olfactory receptors in Drosophila and other insects project to an olfactory processing 
center, the antennal lobe, which is much like the olfactory bulb of vertebrates. Like its 
vertebrate counterpart, the antennal lobe contains olfactory glomeruli, of which the antennal 

20 lobe of Drosophila has approximately forty (Stacker et al, (1995) Roux's Arch Dev Biol 205, 
62-72; Laissue et al, (1999) J. Comp. Neurol. 405, 543-552). In vertebrates there is an 
approximate equivalence between the estimated number of odorant receptor genes and the 
number of glomeruli (Barth et al, (1996) Neuron 16, 23-34; Buck, (1996) Annu. Rev. 
Neurosci. 19, 517-544); since C. elegans does not contain glomeruli, it has not been possible 

25 until now to consider whether the evolutionary conservation of this equivalence extends to 
invertebrates. If in fact the number of DOR genes is one-hundred, then the ratio of odorant 
receptor genes to glomeruli would exceed two, and would rise if additional families of odorant 
receptor genes were discovered. Of particular interest, the number of glomeruli receiving 
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input from the maxillary palp has been variously estimated as three and five (V enkatesh & 
Singh, (1984) Int. J. Insect. Morphol. Embryol. 13, 51-63; Stacker et al, (1995) Roux's Arch 
Dev Biol 205, 62-72); if our estimate of eighteen genes expressed in the maxillary palp is 
correct, then the ratio of these receptor genes to their corresponding glomeruli would fall in 
the range of three to six. 

Example 4: DOR gene expression during d evelopment 

Recent evidence supports a dual role for the vertebrate olfactory receptors. First, these 
receptors have an instructive role in guiding the axons of olfactory receptors to the correct 
glomeruli during development (Mombaerts et al, (1996) Cell 87, 675-686; Wang et al, 
(1998) Cell 93, 47-60), and second as odorant receptors in the adult (Zhao et al, (1998) 
Science 279, 237-242). To address the possibility that the DOR genes might also play a role 
in development, three DOR probes were hybridized to antennal sections from different stages 
of pupal development. In Drosophila, ORN axons first leave the developing antenna at 
approximately sixteen hours after puparium formation (APF) (Lienhard & Stacker, (1991) 
Development 1 12, 1063-1075; Ray & Rodrigues, (1995) Dev. Biol. 167, 426-438; Reddy et 
al, (1997) Development 124, 703-712), and the diameter of the antennal nerve continues to 
increase until 72 hours APF (Stacker et al, (1995) Roux's Arch. Dev. Biol. 205, 62-72). 
Glomeruli first become visible in the antennal lobe at approximately 48 hours APF. 
Developing antennae were therefore examined at 16, 24, 36, 48, 54, 60, 72 and 93 hours APF 
(adults eclosed from the pupal case at approximately 100 hours). For these developmental 
studies, Drosophila were collected as white prepupae and kept at 25°C on moist filter paper 
for the indicated number of hours, at which time they were fixed. At 25°C the approximate 
time from the white prepupal stage to eclosion is 100 hours (Lockett & Ashburner, (1989) 
Dev. Biol. 134, 430-437). 

Cells positive for 22A.2 were first seen at 60 hours APF, indicating that detectable 
expression begins between 54 and 60 hours, well within the period in which the antennal 
nerve is still increasing in diameter (Figure 6A-B). A subset of cells was labeled at this time, 
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and they were restricted to a subregion of the developing antenna; the pattern appears 
comparable to that of the mature antenna, although this pattern was not characterized in as 
much detail as that of the adult. Labeling with 22A.2 was also observed in antennae at all 
subsequent time points. Interestingly, cells positive for 47E.1 and 25 A. 1 were not observed 
until much later, at the 93 hour time point; they were not observed at any of the earlier times 
(Figure 6C-D and data not shown). For comparison, in situ hybridization was also performed 
with a probe representing the odorant-binding protein OS-E (McKenna et al, (1994) J. Biol. 
Chem. 269, 16340-16347), which is believed to play a role in olfactory function, but which 
has not been implicated in a developmental process. OS-E was also first observed at 93 hours, 
at which time it expression increased (Figure 6E-F). 

Example 5: Regulation of DOR expression bv POT I domain tr anscription factor aa'6 

Little is known about the regulation of odor receptor genes, a process critical to the 
establishment of olfactory neuron identity and ultimately to the process of olfactory coding. 
In C. elegans the odr7 gene, a member of the nuclear receptor superfamily, has been shown to 
regulate the odorant receptor gene odrlO (Sengupta et al, (1994) Cell 79, 971-980; Sengupta 
et al, (1996) Cell 84, 899-909). In Drosophila, null mutations of the acj6 gene, which 
encodes a POU domain transcription factor, eliminate the odor response of three of the six 
classes of maxillary palp olfactory receptors (Clyne et al, (1999) Neuron 22, 339-347). A 
fourth ORN class on the maxillary palp is altered to a new class of ORN with a novel odor 
sensitivity. These data suggest that Acj6 plays a role in the differentiation of certain maxillary 
palp olfactory receptors, perhaps by determining which olfactory receptor gene(s) are 
expressed. To address the possibility that Acj6 regulates odorant receptor genes, probes from 
the 33B.3 and 46F.1 genes were hybridized to sections of maxillary palps from the null 
mutant, acj6 6 . No hybridization was detected in either case (Figure 4D and data not shown), 
nor was expression of either gene detected by RT-PCR from acj6 6 maxillary palps (Table 1). 

acj6 mutations also affect the physiological response of the antennal neurons to odors 
(Ayer & Carlson, (1991) Proc. Nat. Acad. Sci. USA 88, 5467-5471; Ayer & Carlson, (1992) 
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J. Neurobiol. 23, 965-982). 22A.2, 25A.1, and 47E.1 probes were therefore hybridized to 
sections of acj6 6 antennae. All three probes hybridized to groups of cells in the same locations 
as in the wild type antenna (Figure 5F and data not shown). RT-PCR amplification showed 
that expression of certain other DOR genes, 33B.1, 33B.2, 33B.3, and 46F.2 was eliminated 
5 in the antenna of acj6 6 (Table 1). Thus, in the acj6 6 mutant, one subset of candidate odorant 
receptor genes was not expressed while a different subset remained unaffected. Interestingly, 
genes within a cluster all showed similar dependency on Acj6: 33B.1, 33B.2, and 33B.3, for 
example, all depended on Acj6, whereas 22A.1 and 22A.2 did not. In summary, these data 
support a role for acj6 in the regulation of a subset of olfactory receptor genes. 
10 The DOR family is subj ect to complex regulation. First, the expression of individual 

DOR genes exhibits highly specific tissue and spatial localization. Some genes are expressed 
in the antenna but not the maxillary palp; others show expression in the maxillary palp but not 
the antenna. Within an organ, expression of a particular DOR gene is restricted to a subset of 
cells. In the antenna, the patterns of expression are spatially regulated, exhibiting regional 
1 5 specificity of expression as detailed above. In the maxillary palp, expression is limited to a 
population of neurons approximately equal in number to the neurons of a functional class. 

DOR genes are also subject to interesting temporal regulation. One gene, 22A.2, is 
expressed in the developing antenna during a time when the antennal nerve is still increasing 
in diameter (Stacker et al, (1995) Roux's Arch. Dev. Biol. 205, 62-72) . These data leave 
20 open a possible role for Drosophila olfactory receptors in axon guidance and glomerulus 

formation, a role for which evidence has been found in vertebrates (Mombaerts et al, (1996) 
Cell 87, 675-686; Wang et al, (1998) Cell 93, 47-60) but not C. elegans. In zebrafish, 
odorant receptors show asynchronous onset of expression during development of the olfactory 
placode (Barth et al, (1996) Neuron 16, 23-34). The DOR genes also show heterogeneity in 
25 their temporal regulation: expression of two other DOR genes begins much later than for the 
22A.2 gene. If in fact individual olfactory receptors express more than one DOR gene, 
perhaps some have acquired a specialized role in development. 

Evidence also exists indicating that different DOR genes are expressed at different 
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levels of abundance within cells. Although RT-PCR experiments demonstrated expression of 
25 A. 1 in both antenna and maxillary palp, in situ hybridization revealed expression of 25 A. 1 
only in the antenna of each animal examined; conversely, although RT-PCR experiments 
showed expression of 33B.3 in both olfactory organs, in situ hybridization detected label only 
5 in the maxillary palp of each animal examined (Tables 1 and 2). These results suggest that a 
receptor gene may be expressed at different cellular levels in the two organs, and that different 
genes may be expressed at different cellular levels in the same organ. Such an explanation 
would suggest that there are mechanisms governing not only the spatial and temporal control 
of DOR genes, but also their levels of expression. 
1 o If DOR genes are in fact expressed at different cellular levels in particular olfactory 

receptors, then perhaps the four DOR genes that were undetectable in the antenna by in situ 
hybridization, despite clear evidence for their antennal expression from RT-PCR, a more 
sensitive technique, are among those expressed at low levels. It is important to note that in C. 
elegans, expression of a number of candidate odorant receptors was undetectable using GFP 
15 fusion genes (Troemel et ah, (1995) Cell 83, 207-218). 

As a first step in investigating the mechanisms through which the complex regulation 
of DOR genes is achieved, the role of the POU domain transcription factor Acj6 was tested, 
which was previously found to act in governing olfactory neuron identity. Applicants found 
that Acj6 is in fact required for expression of the DOR family. Two lines of evidence, 
20 RT-PCR and in situ hybridization analysis, both indicate that proper expression of a specific 
subset of DOR genes depends on Acj6. The results indicate that the odor-specificity of a 
subset of olfactory receptors is governed at least in part by the action of the Acj6 POU domain 
transcription factor on DOR genes, and are fully consistent with the notion that DOR genes 
encode odorant receptors. 
25 The isolation of genes likely to encode odorant receptors in Drosophila opens a 

number of avenues for future investigation. Drosophila provides the ability to manipulate 
odor receptors genetically and test the functional consequences of such manipulations in vivo, 
either physiologically or behaviorally. Such analysis may be useful in examining potential 
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roles of DOR proteins in olfactory response and in development. It may also be possible to 
isolate homologous genes in other insects, including some which provide excellent 
opportunities for research and some of agricultural or medical importance which rely on 
olfactory cues to locate their hosts. 

5 

Example 6: Transgenic Drosophila 

P element mediated germline transformation of Drosophila can be carried out as 
previously described (Rubin & Spradling, (1982) Science 218, 348-353). Drosophila 
embryos are isolated and microinjected with P element expression constructs as previously 

1 0 described (Karess & Rubin, (1984) Cell 38, 135-146) containing a particular DOR nucleotide 
sequence, at 0.5 mg/ml together with a helper plasmid at 0.1 mg/ml. Go injected adults are 
individually back crossed to the recipient strain and the Gi progeny screened for the w+ 
transformation marker (Klemenz et al, (1987) Nucleic Acids Res. 10, 3947-3959). 
Transformed lines homoaygous for the rransgene are established from orange eyed Gi flies as 

15 previously described (Klemenz et al, (1987) Nucleic Acids Res. 10, 3947-3959). 

A line of Drosophila in which the DOR33B.3 gene can be over-expressed was 
constructed as described above. The DOR33B.3 coding sequences were joined to an 
upstream activating sequence (UAS) and introduced by P element-mediated germline 
transformation into Drosophila. A yeast GAL4 transcription factor gene, coupled to a heat 

20 shock promoter, was then crossed into the transgenic line. As expected, heat shock of this line 
resulted in induction of DOR33B.3 expression. The heat shock-induced expression of GAL4, 
results in binding of GAL4 to the UAS, and subsequent induction of DOR33B.3 expression. 
This transgenic line of Drosophila, and three other transgenic lines containing other DOR 
genes, can be tested for elevated responses to any of fifty different odors. Elevated response 

25 to any particular odorant is indicative of an ligand which binds and activates the over- 
expressed receptor (see, e.g., Zhao & Firestein, (1998) Science 279, 237-242). 



Although the present invention has been described in detail with reference to 
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examples above, it is understood that various modifications can be made without departing 
from the spirit of the invention. Accordingly, the invention is limited only by the following 
claims. All cited patents and publications referred to in this application are herein 
incorporated by reference in their entirety. The results of the experiments disclosed herein 
have been published in the journal Neuron (22, 327-338) in February, 1999, this article herein 
incorporated by reference in its entirety. 
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We claim: 

1 . An isolated nucleic acid molecule selected from the group consisting of: 

a) an isolated nucleic acid molecule that encodes the amino acid sequence of a 
Drosophila Odorant Receptor protein; 

b) an isolated nucleic acid molecule that encodes a protein fragment of at least 6 amino 
acids of a Drosophila Odorant Receptor protein; and 

c) an isolated nucleic acid molecule which hybridizes to a nucleic acid molecule 
comprising a nucleotide sequence encoding a Drosophila Odorant Receptor protein under 
conditions of sufficient stringency to produce a clear signal. 

2. The isolated nucleic acid molecule of claim 1 wherein the nucleic acid comprises at 
least one exon-intron boundary located in a position selected from the group consisting of: 

a) the nucleotides encoding the amino acids which comprise the third extracellular 
domain of a Drosophila Odorant Receptor protein; 

b) the nucleotides encoding the amino acids which comprise the fourth extracellular 
domain of a Drosophila Odorant Receptor protein; and 

c) the nucleotides encoding the amino acids which comprise the fourth intracellular 
domain of a Drosophila Odorant Receptor protein. 

3. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule is 
selected from the group consisting of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 
27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 
77, 79, 81, 83, 85, 87, 89, 91, 93, 95 and 97. 

4. The isolated nucleic acid molecule of any one of claims 1 -3, wherein said nucleic 
acid molecule is operably linked to one or more expression control elements. 



-71 - 



Attorney Docket No. 44574-5061 

5. A vector comprising an isolated nucleic acid molecule of any one of claims 1-3. 

6. A host cell transformed to contain the nucleic acid molecule of any one of 
claims 1-3. 

7. A host cell comprising a vector of claim 5. 

8. A host cell of claim 7, wherein said host is selected from the group consisting of 
prokaryotic hosts and eukaryotic hosts. 

9. A method for producing a protein or protein fragment comprising the step of 
culturing a host cell transformed with the nucleic acid molecule of any one of claims 1-3 
under conditions in which the protein or protein fragment encoded by said nucleic acid 
molecule is expressed. 

10. The method of claim 9, wherein said host cell is selected from the group 
consisting of prokaryotic hosts and eukaryotic hosts. 

1 1 . An isolated protein or protein fragment produced by the method of claim 10. 

12. An isolated protein or protein fragment selected from the group consisting of: 

a) an isolated protein comprising one of the amino acid sequences depicted in SEQ ID 
NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 
52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96 and 98; 

b) an isolated protein fragment comprising at least 6 amino acids of any of the 
sequences depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 
36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 
86, 88, 90, 92, 94, 96 and 98; 
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c) an isolated protein comprising conservative amino acid substitutions of any of the 
sequences depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 
36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 
86, 88, 90, 92, 94, 96 and 98; and 

d) naturally occurring amino acid sequence variants of any of the sequences depicted 
in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 
46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 
96 and 98. 

13. The isolated protein or protein fragment of claim 12 wherein the protein or protein 
fragment has at least one of the following conserved amino acids selected from the group 
consisting of: 

a) Leucine in the third extracellular domain of a Drosophila Odorant Receptor protein; 

b) Histidine in the third extracellular domain of a Drosophila Odorant Receptor 
protein; 

c) Cysteine in the sixth transmembrane domain of a Drosophila Odorant Receptor 

protein; 

d) Tryptophan in the fourth extracellular domain of a Drosophila Odorant Receptor 

protein; 

e) Glutarnine in the seventh transmembrane domain of a Drosophila Odorant Receptor 

protein; 

f) Proline in the seventh transmembrane domain of a Drosophila Odorant Receptor 

protein; 

g) Alanine in the fourth intracellular domain of a Drosophila Odorant Receptor 
protein; and 

h) Tyrosine in the fourth intracellular domain of a Drosophila Odorant Receptor 

protein. 
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14. An isolated antibody that binds to a polypeptide of claim 1 1, 12 or 13. 

15. The antibody of claim 14 wherein said antibody is a monoclonal or polyclonal 
antibody. 

5 

16. A method of identifying an agent which modulates the expression of a protein or 
protein fragment of claim 11, 12 or 13 comprising the steps of: 

a) exposing cells which express the protein or protein fragment to the agent; and 

b) determining whether the agent modulates expression of said protein or protein 

1 0 fragment, thereby identifying an agent which modulates the expression of a protein or protein 
fragment of claim 11, 12 or 13. 

17. A method of identifying an agent which modulates the activity of a protein or 
protein fragment of claim 1 1, 12 or 13 comprising the steps of: 

15 a) exposing cells which express the protein or protein fragment to the agent; and 

b) determining whether the agent modulates the activity of said protein or protein 
fragment, thereby identifying an agent which modulates the activity of a protein or protein 
fragment of claim 11, 12 or 13. 

20 18. The method of claim 17, wherein the agent modulates at least one activity of the 

protein or protein fragment. 

19. A method of identifying an agent which modulates the transcription of the nucleic 
acid molecule of any one of claims 1-3 comprising the steps of: 
25 a) exposing cells which transcribe the nucleic acid to the agent; and 

b) determining whether the agent modulates transcription of said nucleic acid, thereby 
identifying an agent which modulates the transcription of the nucleic acid molecule of any one 
of claims 1-3. 
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20. A method of identifying binding partners for a protein or protein fragment of 
either claim 1 1, 12 or 13 comprising the steps of: 

a) exposing said protein or protein fragment to a potential binding partner; and 

b) determining if the potential binding partner binds to said protein or protein 
5 fragment, thereby identifying binding partners for the protein or protein fragment. 



21. A method of modulating the expression of a nucleic acid encoding a protein or 
protein fragment of claim 11, 12 or 13 comprising administering an effective amount of an 
agent which modulates the expression of a nucleic acid encoding the protein or protein 
10 fragment. 



22. A method of modulating at least one activity of a protein or protein fragment of 
claim 11, 12 or 13 comprising the step of administering an effective amount of an agent 
which modulates at least one activity of the protein or protein fragment. 

15 

23. A method of identifying novel olfactory receptor genes comprising the steps of: 

a) selecting candidate olfactory receptor genes by screening a nucleic acid database 
using an algorithm trained to identify seven transmembrane receptors genes; 

b) screening said selected candidate olfactory receptor genes by identifying nucleic 
20 acid sequences with conserved amino acid residues and intron-exon boundaries common to 

olfactory receptors, and having open reading frames of sufficient size so as to encode a seven 
transmembrane receptor; and 

c) identifying the novel olfactory receptor genes and measuring the expression of 
olfactory receptor genes wherein the detection of expression confirms said candidate olfactory 

25 gene as an olfactory gene. 



24. A method of identifying novel olfactory receptor genes comprising the steps of: 
a) selecting candidate olfactory receptor genes by screening a nucleic acid database for 
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nucleic acid sequences with sufficient homology to at least one known olfactory receptor 
gene; 

b) screening said selected candidate olfactory receptor genes by identifying nucleic 
acids with conserved amino acid residues and intron-exon boundaries common to olfactory 

5 receptors, and having open reading frames of sufficient size so as to encode a seven 
transmembrane receptor; and 

c) identifying the novel olfactory receptor genes and measuring the expression of 
olfactory receptor genes wherein the detection of expression confirms said candidate olfactory 
gene as an olfactory gene. 

10 

25. A transgenic insect modified to contain a nucleic acid molecule of any of claims 

1-3. 

26. The transgenic insect of claim 25, wherein the nucleic acid molecule contains a 
1 5 mutation that alters expression of the encoded protein. 
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ABSTRACT 

The present invention provides nucleic acids and amino acids for novel olfactory 
receptors as well as methods for identifying olfactory receptors. More specifically, the 
present invention provides nucleic acids and amino acids for novel olfactory receptors in 
5 Drosophila as well as methods of using the provided nucleic acids and amino acids. In 
addition, this invention provides methods of identifying ligands which bind to the novel 
olfactory receptors as well as a variety of methods for using the ligands so identified. 
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SEQUENCE LISTING 



<110> Yale University- 
Carlson, John R. 
Kim, Hunhyong 
Clyne, Peter J. 
Warr, Coral G. 

<120> Novel Family of Odorant Receptor Genes in Drosophila 

<130> 44574-5061-WO 

<140> 
<141> 

<150> US 60/117,132 
<151> 1999-01-25 

<160> 112 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 767 
<212> DNA 

<213> Drosophila melanogaster 

<220> 
<221> CDS 
<222> (1) . . (765) 
<223> DOR 22A.2, coding 
AF127924 

<400> 1 

atg tta age cag ttc ttt 
Met Leu Ser Gin Phe Phe 
1 5 

egg gtt aag tec cga gat 
Arg Val Lys Ser Arg Asp 
20 

tec ttt ggc tgg aca gtg 
Ser Phe Gly Trp Thr Val 
35 

aaa ctg tgg tea act ttc 
Lys Leu Trp Ser Thr Phe 
50 



region of NCBI Acc< 



ccc cac att aaa gaa aag cca ttg age gag 
Pro His lie Lys Glu Lys Pro Leu Ser Glu 



gee ttc gtt tac tta gat egg gtg atg tgg 
Ala Phe Val Tyr Leu Asp Arg Val Met Trp 



cct gaa aac aaa agg tgg gat eta cat tac 
Pro Glu Asn Lys Arg Trp Asp Leu His Tyr 



gtg aca ttg ttg ata ttt ate ctt ctg ccg 
Val Thr Leu Leu lie Phe lie Leu Leu Pro 



ata teg gta age gtt gag tat att cag egg ttc aag acc ttc teg gcg 
lie Ser Val Ser Val Glu Tyr lie Gin Arg Phe Lys Thr Phe Ser Ala 



ggt gag ttt ctt age tea ate cag att ggc gtt aac atg tac gga age 
Gly Glu Phe Leu Ser Ser lie Gin lie Gly Val Asn Met Tyr Gly Ser 



age ttt aaa agt tat ttg acc atg atg gga tat aag aag aga cag gag 
Ser Phe Lys Ser Tyr Leu Thr Met Met Gly Tyr Lys Lys Arg Gin Glu 
100 105 110 

get aag atg tea ctg gat gag ctg gac aag aga tgc gtt tgt gat gag 
Ala Lys Met Ser Leu Asp Glu Leu Asp Lys Arg Cys Val Cys Asp Glu 
115 120 125 

gag agg acc att gta cat cga cat gtc gec ctg gga aac ttt tgc tat 
Glu Arg Thr lie Val His Arg His Val Ala Leu Gly Asn Phe Cys Tyr 
130 135 140 

att ttc tat cac att gcg tac act age ttt ttg att tea aac ttt ttg 
lie Phe Tyr His lie Ala Tyr Thr Ser Phe Leu lie Ser Asn Phe Leu 
145 150 155 160 

tea ttt ata atg aag aga ate cat gee tgg cgc atg tac ttt ccc tac 
Ser Phe He Met Lys Arg He His Ala Trp Arg Met Tyr Phe Pro Tyr 
165 170 175 

gtc gac ccc gaa aag caa ttt tac ate tct age ate gec gaa gtc att 
Val Asp Pro Glu Lys Gin Phe Tyr He Ser Ser He Ala Glu Val He 
180 185 190 

ctt agg gga tgg gec gtc ttc atg gat etc tgc acg gat gtg tgt cct 
Leu Arg Gly Trp Ala Val Phe Met Asp Leu Cys Thr Asp Val Cys Pro 
195 200 205 

ttg ate tec atg gta ata gca cga tgc cac ate acc ctt ctg aaa cag 
Leu He Ser Met Val He Ala Arg Cys His He Thr Leu Leu Lys Gin 
210 215 220 

cgc ctg cga aat eta cga teg gaa cca gga agg acg gaa gat gag tac 
Arg Leu Arg Asn Leu Arg Ser Glu Pro Gly Arg Thr Glu Asp Glu Tyr 
225 230 235 240 

ttg aag gag etc gee gac tgc gtt cga gat cac cgc ttg ata ttg ga 
Leu Lys Glu Leu Ala Asp Cys Val Arg Asp His Arg Leu He Leu 
245 250 255 



2 



<210> 2 
<211> 255 
<212> PRT 

<213> Drosophila melanogaster 



<400> 2 

Met Leu Ser Gin Phe Phe Pro His lie Lys Glu Lys Pro Leu Ser Glu 



Arg Val Lys Ser Arg Asp Ala Phe Val Tyr Leu Asp Arg Val Met Trp 

20 25 30 

Ser Phe Gly Trp Thr Val Pro Glu Asn Lys Arg Trp Asp Leu His Tyr 

35 40 45 

Lys Leu Trp Ser Thr Phe Val Thr Leu Leu lie Phe lie Leu Leu Pro 

50 55 60 

He Ser Val Ser Val Glu Tyr He Gin Arg Phe Lys Thr Phe Ser Ala 



Gly Glu Phe Leu Ser Ser He Gin He Gly Val Asn Met Tyr Gly Ser 

85 90 ' 95 

Ser Phe Lys Ser Tyr Leu Thr Met Met Gly Tyr Lys Lys Arg Gin Glu 
100 105 110 

Ala Lys Met Ser Leu Asp Glu Leu Asp Lys Arg Cys Val Cys Asp Glu 
115 120 125 

Glu Arg Thr He Val His Arg His Val Ala Leu Gly Asn Phe Cys Tyr 
130 135 140 

He Phe Tyr His He Ala Tyr Thr Ser Phe Leu He Ser Asn Phe Leu 
145 150 155 160 

Ser Phe He Met Lys Arg He His Ala Trp Arg Met Tyr Phe Pro Tyr 
165 170 175 

Val Asp Pro Glu Lys Gin Phe Tyr He Ser Ser He Ala Glu Val He 
180 185 190 

Leu Arg Gly Trp Ala Val Phe Met Asp Leu Cys Thr Asp Val Cys Pro 
195 200 205 

Leu He Ser Met Val He Ala Arg Cys His He Thr Leu Leu Lys Gin 
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210 215 220 

Arg Leu Arg Asn Leu Arg Ser Glu Pro Gly Arg Thr Glu Asp Glu Tyr 
225 230 235 240 

Leu Lys Glu Leu Ala Asp Cys Val Arg Asp His Arg Leu He Leu 
245 250 255 



<210> 3 
<211> 1140 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1137) 

<223> DOR 22C.1, a coding segment on BDGP Clone No. 
AC004716 

<400> 3 

atg act gac age ggg cag cct gec att gec gac cac ttt tat egg att 

Met Thr Asp Ser Gly Gin Pro Ala He Ala Asp His Phe Tyr Arg He 

15 10 15 

ccc cgc ate tec ggc etc att gtc ggc etc tgg ccg caa agg ata agg 
Pro Arg He Ser Gly Leu He Val Gly Leu Trp Pro Gin Arg He Arg 



ggc ggg ggc ggt cgt cct tgg cac gec cat ctg etc ttc gtg ttc gee 
Gly Gly Gly Gly Arg Pro Trp His Ala His Leu Leu Phe Val Phe Ala 
35 40 45 

ttc gee atg gtg gtg gtg ggt gcg gtg ggc gag gtg teg tac ggc tgt 
Phe Ala Met Val Val Val Gly Ala Val Gly Glu Val Ser Tyr Gly Cys 
50 55 60 

gtc cac ctg gac aac ctg gtg gtg gcg ctg gag gee ttc tgc ccc gga 
Val His Leu Asp Asn Leu Val Val Ala Leu Glu Ala Phe Cys Pro Gly 

65 70 75 80 

acc ace aag gcg gtc tgc gtt ttg aag ctg tgg gtc ttc ttc cgc tec 
Thr Thr Lys Ala Val Cys Val Leu Lys Leu Trp Val Phe Phe Arg Ser 
85 90 95 

aat cgc egg tgg gcg gag ttg gtc cag cgc ctg egg get att ttg etc 
Asn Arg Arg Trp Ala Glu Leu Val Gin Arg Leu Arg Ala He Leu Leu 



100 



105 



110 



age ctg ttg ttg etc age tct ggc acg gcg aca aat gec gec ttc acc 
Ser Leu Leu Leu Leu Ser Ser Gly Thr Ala Thr Asn Ala Ala Phe Thr 
115 120 125 

ttg caa ccg ctg att atg ggt etc tac cgc tgg att gtg cag ctg cca 
Leu Gin Pro Leu He Met Gly Leu Tyr Arg Trp He Val Gin Leu Pro 
130 135 140 

ggt caa acc gag ctg ccc ttt aat ate ata ctg ccc teg ttt gec gtg 
Gly Gin Thr Glu Leu Pro Phe Asn He He Leu Pro Ser Phe Ala Val 
145 150 155 160 

cag cca gga gtc ttt ccg etc acc tac gtg ctg ctg acc get tec ggt 
Gin Pro Gly Val Phe Pro Leu Thr Tyr Val Leu Leu Thr Ala Ser Gly 
165 170 175 

gee tgc acc gtt ttc gec ttc age ttc gtg gac gga ttc ttc att tgc 
Ala Cys Thr Val Phe Ala Phe Ser Phe Val Asp Gly Phe Phe He Cys 
180 185 190 

teg tgc etc tac ate tgc ggc get ttc egg ctg gtg cag cag gac att 
Ser Cys Leu Tyr He Cys Gly Ala Phe Arg Leu Val Gin Gin Asp lie 
195 200 205 

cgc agg ata ttt gec gat ttg cat ggc gtg gat gtg ttc acc gag gag 
Arg Arg He Phe Ala Asp Leu His Gly Val Asp Val Phe Thr Glu Glu 
210 215 220 

atg aac gcg gag gtg egg cac aga ctg gee caa gtt gtc gag egg cac 
Met Asn Ala Glu Val Arg His Arg Leu Ala Gin Val Val Glu Arg His 
225 230 235 240 

aat gcg att ate gat ttc tgc acg gac eta aca cgc cag ttc acc gtt 
Asn Ala He He Asp Phe Cys Thr Asp Leu Thr Arg Gin Phe Thr Val 
245 250 255 

ate gtt tta atg cat ttc ctg tec gee gec ttc gtc etc tgc teg acc 
He Val Leu Met His Phe Leu Ser Ala Ala Phe Val Leu Cys Ser Thr 
260 265 270 

ate ctg gac ate atg ttg aac acg teg teg ttg age ggc tta acc tac 
He Leu Asp He Met Leu Asn Thr Ser Ser Leu Ser Gly Leu Thr Tyr 
275 280 285 

ate tgc tat ate ate gcg gec eta acg cag eta ttc etc tac tgc ttc 
He Cys Tyr He He Ala Ala Leu Thr Gin Leu Phe Leu Tyr Cys Phe 



912 



5 



290 295 300 

gga ggc aat cac gtc age gag agt agt gcg get gtg gcg gac gtg ctg 960 
Gly Gly Asn His Val Ser Glu Ser Ser Ala Ala Val Ala Asp Val Leu 
305 310 315 320 

tac gac atg gag tgg tac aaa tgc gat gcg agg act agg aaa gtg att 1008 
Tyr Asp Met Glu Trp Tyr Lys Cys Asp Ala Arg Thr Arg Lys Val He 
325 330 335 

tta atg ata ttg cgc cgt teg cag egg gca aaa aca att gcg gtg ccg 1056 
Leu Met He Leu Arg Arg Ser Gin Arg Ala Lys Thr He Ala Val Pro 
340 345 350 

ttt ttt acg ccc tea ctg cca gca etc cga tct ata etc age aca gee 1104 
Phe Phe Thr Pro Ser Leu Pro Ala Leu Arg Ser He Leu Ser Thr Ala 
355 360 365 

gge tea tat ate acg ctg eta aag acg ttc ctg taa 1140 
Gly Ser Tyr He Thr Leu Leu Lys Thr Phe Leu 
370 375 



<210> 4 
<211> 379 
<212> PRT 

<213> Drosophila melanogaster 
<400> 4 

Met Thr Asp Ser Gly Gin Pro Ala He Ala Asp His Phe Tyr Arg He 
15 10 15 

Pro Arg He Ser Gly Leu He Val Gly Leu Trp Pro Gin Arg He Arg 



Gly Gly Gly Gly Arg Pro Trp His Ala His Leu Leu Phe Val Phe Ala 
35 40 45 

Phe Ala Met Val Val Val Gly Ala Val Gly Glu Val Ser Tyr Gly Cys 
50 55 60 

Val His Leu Asp Asn Leu Val Val Ala Leu Glu Ala Phe Cys Pro Gly 



Thr Thr Lys Ala Val Cys Val Leu Lys Leu Trp Val Phe Phe Arg Ser 
85 90 95 

Asn Arg Arg Trp Ala Glu Leu Val Gin Arg Leu Arg Ala He Leu Leu 



6 



100 



105 



110 



Ser Leu Leu Leu 
115 

Leu Gin Pro Leu 
130 

Gly Gin Thr Glu 
145 

Gin Pro Gly Val 



Ala Cys Thr Val 
180 

Ser Cys Leu Tyr 
195 

Arg Arg lie Phe 
210 

Met Asn Ala Glu 
225 

Asn Ala lie lie 



He Val Leu Met 
260 

lie Leu Asp He 
275 

He Cys Tyr He 
290 

Gly Gly Asn His 
305 

Tyr Asp Met Glu 



Leu Met He Leu 
340 

Phe Phe Thr Pro 



Leu Ser Ser Gly 

120 

He Met Gly Leu 
135 

Leu Pro Phe Asn 
150 

Phe Pro Leu Thr 
165 

Phe Ala Phe Ser 



He Cys Gly Ala 
200 

Ala Asp Leu His 
215 

Val Arg His Arg 
230 

Asp Phe Cys Thr 
245 

His Phe Leu Ser 



Met Leu Asn Thr 
280 

He Ala Ala Leu 
295 

Val Ser Glu Ser 
310 

Trp Tyr Lys Cys 
325 

Arg Arg Ser Gin 



Ser Leu Pro Ala 



Thr Ala Thr Asn 



Tyr Arg Trp He 
140 

He He Leu Pro 
155 

Tyr Val Leu Leu 
170 

Phe Val Asp Gly 

185 

Phe Arg Leu Val 



Gly Val Asp Val 
220 

Leu Ala Gin Val 
235 

Asp Leu Thr Arg 
250 

Ala Ala Phe Val 
265 

Ser Ser Leu Ser 



Thr Gin Leu Phe 
300 

Ser Ala Ala Val 
315 

Asp Ala Arg Thr 
330 

Arg Ala Lys Thr 
345 

Leu Arg Ser He 



Ala Ala Phe Thr 

125 

Val Gin Leu Pro 



Ser Phe Ala Val 
160 

Thr Ala Ser Gly 
175 

Phe Phe He Cys 
190 

Gin Gin Asp He 
205 

Phe Thr Glu Glu 



Val Glu Arg His 
240 

Gin Phe Thr Val 
255 

Leu Cys Ser Thr 
270 

Gly Leu Thr Tyr 
285 

Leu Tyr Cys Phe 



Ala Asp Val Leu 
320 

Arg Lys Val He 
335 

He Ala Val Pro 
350 

Leu Ser Thr Ala 



7 



Gly Ser Tyr lie Thr Leu Leu Lys Thr Phe Leu 
370 375 



<210> 5 
<211> 1140 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1137) 

<223> DOR23A.1, coding region of AF127925 
<400> 5 

atg aag etc age gaa acc eta aaa ate gac tat ttt cga gtc cag ttg 
Met Lys Leu Ser Glu Thr Leu Lys He Asp Tyr Phe Arg Val Gin Leu 
15 10 15 

aat gee tgg cga att tgt ggt gee ttg gat etc age gag ggt agg tac 
Asn Ala Trp Arg He Cys Gly Ala Leu Asp Leu Ser Glu Gly Arg Tyr 



tgg agt tgg teg atg eta ttg tgc ate ttg gtg tac ctg ccg aca ccc 
Trp Ser Trp Ser Met Leu Leu Cys He Leu Val Tyr Leu Pro Thr Pro 



atg eta ctg aga gga gta tac agt ttc gaa gat ccg gtg gaa aat aat 
Met Leu Leu Arg Gly Val Tyr Ser Phe Glu Asp Pro Val Glu Asn Asn 



ttc age ttg age ctg acg gtc act teg ctg tec aat etc atg aag ttc 
Phe Ser Leu Ser Leu Thr Val Thr Ser Leu Ser Asn Leu Met Lys Phe 



tgc atg tac gtg gee caa eta aca aag atg gtc gag gtc cag agt ctt 
Cys Met Tyr Val Ala Gin Leu Thr Lys Met Val Glu Val Gin Ser Leu 



att ggt cag ctg gat gec egg gtt tct ggc gag age cag tct gag cgt 
He Gly Gin Leu Asp Ala Arg Val Ser Gly Glu Ser Gin Ser Glu Arg 
100 105 110 



cat aga aat atg acc gag cac ctg eta agg atg tec aag ctg ttc cag 
His Arg Asn Met Thr Glu His Leu Leu Arg Met Ser Lys Leu Phe Gin 



115 120 125 

ate acc tac get gta gtc ttc ate att get gca gtt ccc ttc gtt ttc 
He Thr Tyr Ala Val Val Phe He He Ala Ala Val Pro Phe Val Phe 
130 135 140 

gaa act gag eta age tta ccc atg ccc atg tgg ttt ccc ttc gac tgg 
Glu Thr Glu Leu Ser Leu Pro Met Pro Met Trp Phe Pro Phe Asp Trp 
145 150 155 160 

aag aac teg atg gtg gec tac ate gga get ctg gtt ttc cag gag att 
Lys Asn Ser Met Val Ala Tyr He Gly Ala Leu Val Phe Gin Glu He 
165 170 175 

ggc tat gtc ttt caa att atg caa tgc ttt gca get gac teg ttt ccc 
Gly Tyr Val Phe Gin He Met Gin Cys Phe Ala Ala Asp Ser Phe Pro 
180 185 190 

ccg etc gta ctg tac ctg ate tec gag caa tgt caa ttg ctg ate ctg 
Pro Leu Val Leu Tyr Leu He Ser Glu Gin Cys Gin Leu Leu He Leu 
195 200 205 

aga ate tct gaa ate gga tat ggt tac aag act ctg gag gag aac gaa 
Arg He Ser Glu He Gly Tyr Gly Tyr Lys Thr Leu Glu Glu Asn Glu 
210 215 220 

cag gat ctg gtc aac tgc ate agg gat caa aac gcg ctg tat aga tta 
Gin Asp Leu Val Asn Cys He Arg Asp Gin Asn Ala Leu Tyr Arg Leu 
225 230 235 240 

etc gat gtg acc aag agt etc gtt teg tat ccc atg atg gtg cag ttt 
Leu Asp Val Thr Lys Ser Leu Val Ser Tyr Pro Met Met Val Gin Phe 
245 250 255 

atg gtt att ggc ate aac ate gec ate acc eta ttt gtc ctg ata ttt 
Met Val He Gly He Asn He Ala He Thr Leu Phe Val Leu He Phe 
260 265 270 

tac gtg gag acc ttg tac gat cgc ate tat tat ctt tgc ttt etc ttg 
Tyr Val Glu Thr Leu Tyr Asp Arg He Tyr Tyr Leu Cys Phe Leu Leu 
275 280 285 

ggc ate acc gtg cag aca tat cca ttg tgc tac tat gga acc atg gtg 
Gly He Thr Val Gin Thr Tyr Pro Leu Cys Tyr Tyr Gly Thr Met Val 
290 295 300 

cag gag agt ttt get gag ctt cac tat gcg gta ttc tgc age aac tgg 
Gin Glu Ser Phe Ala Glu Leu His Tyr Ala Val Phe Cys Ser Asn Trp 



9 



305 



310 



gtg gat caa agt gcc age tat cgt ggg cac atg etc ate ctg gcg gag 
Val Asp Gin Ser Ala Ser Tyr Arg Gly His Met Leu He Leu Ala Glu 
325 330 335 

cgc act aag egg atg cag ctt etc etc gcc ggc aac ctg gtg ccc ate 
Arg Thr Lys Arg Met Gin Leu Leu Leu Ala Gly Asn Leu Val Pro He 
340 345 350 



cac ctg age acc tac gtg gcc tgt tgg aag gga gcc tac tec ttc ttc 
His Leu Ser Thr Tyr Val Ala Cys Trp Lys Gly Ala Tyr Ser Phe Phe 
355 360 365 

acc ctg atg gcc gat cga gat ggc ctg ggt tct tag 
Thr Leu Met Ala Asp Arg Asp Gly Leu Gly Ser 
370 375 



<210> 6 
<211> 379 
<212> PRT 

<213> Drosophila melanogaster 
<400> 6 

Met Lys Leu Ser Glu Thr Leu Lys He Asp Tyr Phe Arg Val Gin Leu 
15 10 15 

Asn Ala Trp Arg He Cys Gly Ala Leu Asp Leu Ser Glu Gly Arg Tyr 



Trp Ser Trp Ser Met Leu Leu Cys He Leu Val Tyr Leu Pro Thr Pro 
35 40 45 

Met Leu Leu Arg Gly Val Tyr Ser Phe Glu Asp Pro Val Glu Asn Asn 
50 55 60 

Phe Ser Leu Ser Leu Thr Val Thr Ser Leu Ser Asn Leu Met Lys Phe 
65 70 75 80 

Cys Met Tyr Val Ala Gin Leu Thr Lys Met Val Glu Val Gin Ser Leu 
85 90 95 

He Gly Gin Leu Asp Ala Arg Val Ser Gly Glu Ser Gin Ser Glu Arg 
100 105 HO 

His Arg Asn Met Thr Glu His Leu Leu Arg Met Ser Lys Leu Phe Gin 
115 120 125 



10 



He Thr Tyr Ala Val Val Phe He He Ala Ala Val Pro Phe Val Phe 
130 135 140 



Glu Thr Glu Leu Ser Leu Pro Met Pro Met Trp Phe Pro Phe Asp Trp 
145 150 155 160 

Lys Asn Ser Met Val Ala Tyr He Gly Ala Leu Val Phe Gin Glu He 
165 170 175 

Gly Tyr Val Phe Gin He Met Gin Cys Phe Ala Ala Asp Ser Phe Pro 
180 185 190 

Pro Leu Val Leu Tyr Leu He Ser Glu Gin Cys Gin Leu Leu He Leu 
195 200 205 

Arg He Ser Glu He Gly Tyr Gly Tyr Lys Thr Leu Glu Glu Asn Glu 
210 215 220 

Gin Asp Leu Val Asn Cys He Arg Asp Gin Asn Ala Leu Tyr Arg Leu 
225 230 235 240 

Leu Asp Val Thr Lys Ser Leu Val Ser Tyr Pro Met Met Val Gin Phe 
245 250 255 

Met Val He Gly He Asn He Ala He Thr Leu Phe Val Leu He Phe 
260 265 270 

Tyr Val Glu Thr Leu Tyr Asp Arg He Tyr Tyr Leu Cys Phe Leu Leu 
275 280 285 

Gly He Thr Val Gin Thr Tyr Pro Leu Cys Tyr Tyr Gly Thr Met Val 
290 295 300 

Gin Glu Ser Phe Ala Glu Leu His Tyr Ala Val Phe Cys Ser Asn Trp 
305 310 315 320 

Val Asp Gin Ser Ala Ser Tyr Arg Gly His Met Leu He Leu Ala Glu 
325 330 335 

Arg Thr Lys Arg Met Gin Leu Leu Leu Ala Gly Asn Leu Val Pro He 
340 345 350 

His Leu Ser Thr Tyr Val Ala Cys Trp Lys Gly Ala Tyr Ser Phe Phe 
355 360 365 

Thr Leu Met Ala Asp Arg Asp Gly Leu Gly Ser 
370 375 



11 



<210> 7 
<211> 1143 
<212> DNA 

<213> Drosophila melanogaster 



<220> 

<221> CDS 

<222> (1) . . (1140) 

<223> DOR 24D.1, a coding region on BDGP Clone No. 
AC004371 



<400> 7 

atg tta cct cga ttc ctg acc gcc tec tat cca atg gag cgc cat tat 
Met Leu Pro Arg Phe Leu Thr Ala Ser Tyr Pro Met Glu Arg His Tyr 
15 10 15 

ttc atg gtg cca aag ttt gca tta teg ctg att ggt ttt tat ccc gaa 
Phe Met Val Pro Lys Phe Ala Leu Ser Leu lie Gly Phe Tyr Pro Glu 
20 25 30 

cag aag cga acg gtt ttg gtg aaa ctt tgg agt ttc ttc aac ttt ttc 
Gin Lys Arg Thr Val Leu Val Lys Leu Trp Ser Phe Phe Asn Phe Phe 
35 40 45 

ate etc acc tac ggc tgt tat gca gag get tac tat ggc ata cac tat 
lie Leu Thr Tyr Gly Cys Tyr Ala Glu Ala Tyr Tyr Gly He His Tyr 
50 55 60 

ata ccg att aac ata gcc act gca ttg gat gcc ctt tgt cct gtg gcc 
He Pro He Asn He Ala Thr Ala Leu Asp Ala Leu Cys Pro Val Ala 
65 70 75 80 

tec age att ttg teg ctg gtg aaa atg gtc gcc att tgg tgg tat caa 
Ser Ser He Leu Ser Leu Val Lys Met Val Ala He Trp Trp Tyr Gin 
85 90 95 

gat gaa tta agg agt ttg ata gag egg agg ttc tat aca ctg gca acg 
Asp Glu Leu Arg Ser Leu He Glu Arg Arg Phe Tyr Thr Leu Ala Thr 
100 105 110 

caa -eta aca ttc ctg eta eta tgc tgt gga ttt tgc acc agt act tec 
Gin Leu Thr Phe Leu Leu Leu Cys Cys Gly Phe Cys Thr Ser Thr Ser 
115 120 125 

tat tec gtc aga cat ttg att gat aat ate ctg aga cgc acc cat ggc 
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Tyr Ser Val Arg His Leu He Asp Asn He Leu Arg Arg Thr His Gly 
130 135 140 

aag gac tgg ate tac gag act ccg ttc aag atg atg ttc ccc gat ctt 480 
Lys Asp Trp He Tyr Glu Thr Pro Phe Lys Met Met Phe Pro Asp Leu 
145 150 155 160 

etc ctg cgt ttg cca etc tat ccc ate acc tat ata etc gtg cat tgg 528 
Leu Leu Arg Leu Pro Leu Tyr Pro He Thr Tyr He Leu Val His Trp 
165 170 175 

cat ggc tac att act gtg gtt tgt ttt gtc ggc gcg gat ggt ttc ttc 57 6 
His Gly Tyr He Thr Val Val Cys Phe Val Gly Ala Asp Gly Phe Phe 
180 185 190 

ctg ggg ttc tgt ttg tac ttc act gtt ttg ctg etc tgt ctg cag gac 624 
Leu Gly Phe Cys Leu Tyr Phe Thr Val Leu Leu Leu Cys Leu Gin Asp 
195 200 205 

gat gtt tgt gat tta eta gag gtt gaa aac ate gag aag agt ccc tec 672 
Asp Val Cys Asp Leu Leu Glu Val Glu Asn lie Glu Lys Ser Pro Ser 
210 215 220 

gaa gcg gag gaa get cgc ata gtt egg gaa atg gaa aaa ctg gtg gac 720 
Glu Ala Glu Glu Ala Arg He Val Arg Glu Met Glu Lys Leu Val Asp 
225 230 235 240 

egg cat aac gag gtg gee gag ctg aca gaa aga ttg teg ggt gtt atg 7 68 
Arg His Asn Glu Val Ala Glu Leu Thr Glu Arg Leu Ser Gly Val Met 
245 250 255 

gtg gaa ata aca ctg gec cac ttt gtt act teg agt ttg ata ate gga 816 
Val Glu He Thr Leu Ala His Phe Val Thr Ser Ser Leu He He Gly 
260 265 270 

acc age gtg gtg gat att tta tta ttt tec ggc ctg gga ate att gtg 864 
Thr Ser Val Val Asp He Leu Leu Phe Ser Gly Leu Gly He He Val 
275 280 285 

tat gtg gtc tac act tgt gec gta ggt gtg gaa ata ttt eta tac tgt 912 
Tyr Val Val Tyr Thr Cys Ala Val Gly Val Glu lie Phe Leu Tyr Cys 
290 295 300 

tta gga gga tct cat att atg gaa gcg tgt tec aat eta gcg cgc tec 960 
Leu Gly Gly Ser His He Met Glu Ala Cys Ser Asn Leu Ala Arg Ser 
305 310 315 320 

aca ttt tec age cac tgg tat ggc cac agt gtt egg gtc caa aag atg 1008 

13 



Thr Phe Ser Ser His Trp Tyr Gly His Ser Val Arg Val Gin Lys Met 
325 330 335 



acc ctt ttg atg gta get cgt get caa cga gtt etc aca att aaa att 1056 
Thr Leu Leu Met Val Ala Arg Ala Gin Arg Val Leu Thr lie Lys lie 
340 345 350 

cct ttc ttt tec cca tea tta gag act eta act teg att ttg cgc ttc 1104 
Pro Phe Phe Ser Pro Ser Leu Glu Thr Leu Thr Ser lie Leu Arg Phe 
355 360 365 

act gga tct ctg att gec ctg gca aag teg gtt ata taa 1143 
Thr Gly Ser Leu He Ala Leu Ala Lys Ser Val He 
370 375 380 



<210> 8 
<211> 380 
<212> PRT 
<213> Drosophil; 

<400> 8 

Met Leu Pro Arg 
1 

Phe Met Val Pro 
20 

Gin Lys Arg Thr 
35 

He Leu Thr Tyr 
50 

He Pro He Asn 
65 

Ser Ser lie Leu 



Asp Glu Leu Arg 
100 

Gin Leu Thr Phe 
115 

Tyr Ser Val Arg 
130 



melanogaster 



Phe Leu Thr Ala 
5 

Lys Phe Ala Leu 



Val Leu Val Lys 
40 

Gly Cys Tyr Ala 
55 

He Ala Thr Ala 
70 

Ser Leu Val Lys 
85 

Ser Leu He Glu 



Leu Leu Leu Cys 
120 

His Leu He Asp 
135 



Ser Tyr Pro Met 
10 

Ser Leu He Gly 
25 

Leu Trp Ser Phe 



Glu Ala Tyr Tyr 
60 

Leu Asp Ala Leu 
75 

Met Val Ala He 



Arg Arg Phe Tyr 
105 

Cys Gly Phe Cys 



Asn He Leu Arg 
140 



Glu Arg His Tyr 
15 

Phe Tyr Pro Glu 
30 

Phe Asn Phe Phe 
45 

Gly He His Tyr 



Cys Pro Val Ala 
80 

Trp Trp Tyr Gin 
95 

Thr Leu Ala Thr 
110 

Thr Ser Thr Ser 
125 

Arg Thr His Gly 
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Lys Asp Trp lie Tyr Glu Thr Pro Phe Lys Met Met Phe Pro Asp Leu 
145 150 155 160 



Leu Leu Arg Leu Pro 
165 

His Gly Tyr lie Thr 
180 

Leu Gly Phe Cys Leu 
195 

Asp Val Cys Asp Leu 
210 

Glu Ala Glu Glu Ala 
225 

Arg His Asn Glu Val 
245 

Val Glu lie Thr Leu 
260 

Thr Ser Val Val Asp 
275 

Tyr Val Val Tyr Thr 
290 

Leu Gly Gly Ser His 

305 

Thr Phe Ser Ser His 
325 

Thr Leu Leu Met Val 
340 

Pro Phe Phe Ser Pro 
355 

Thr Gly Ser Leu lie 
370 



Leu Tyr Pro lie Thr Tyr 
170 

Val Val Cys Phe Val Gly 
185 

Tyr Phe Thr Val Leu Leu 
200 

Leu Glu Val Glu Asn He 
215 

Arg He Val Arg Glu Met 
230 235 

Ala Glu Leu Thr Glu Arg 
250 

Ala His Phe Val Thr Ser 
265 

He Leu Leu Phe Ser Gly 
280 

Cys Ala Val Gly Val Glu 
295 

He Met Glu Ala Cys Ser 
310 315 

Trp Tyr Gly His Ser Val 
330 

Ala Arg Ala Gin Arg Val 

345 

Ser Leu Glu Thr Leu Thr 
360 

Ala Leu Ala Lys Ser Val 
375 



He Leu Val His Trp 
175 

Ala Asp Gly Phe Phe 
190 

Leu Cys Leu Gin Asp 
205 

Glu Lys Ser Pro Ser 
220 

Glu Lys Leu Val Asp 
240 

Leu Ser Gly Val Met 
255 

Ser Leu He He Gly 
270 

Leu Gly He He Val 
285 

He Phe Leu Tyr Cys 
300 

Asn Leu Ala Arg Ser 
320 

Arg Val Gin Lys Met 
335 

Leu Thr lie Lys He 
350 

Ser He Leu Arg Phe 
365 

He 
380 
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<210> 9 

<211> 1212 

<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1209) 

<400> 9 

atg ttc gga cac ttt aag etc gtc tat ccg get cct ata teg gag ccc 48 

Met Phe Gly His Phe Lys Leu Val Tyr Pro Ala Pro lie Ser Glu Pro 

15 10 15 

ata cag tct agg gat teg aat gca tac atg atg gag acg ctg cga aat 96 
He Gin Ser Arg Asp Ser Asn Ala Tyr Met Met Glu Thr Leu Arg Asn 
20 25 30 

teg ggc ttg aat ttg aag aac gat ttc ggt ata ggc cgc aag att tgg 144 
Ser Gly Leu Asn Leu Lys Asn Asp Phe Gly He Gly Arg Lys He Trp 
35 40 45 

agg gtg ttt teg ttc acc tac aat atg gtg ata ctt ccc gta agt ttc 192 
Arg Val Phe Ser Phe Thr Tyr Asn Met Val He Leu Pro Val Ser Phe 
50 55 60 

cca ate aac tat gtg ata cat ctg gcg gag ttc ccg ccg gag ctg ctg 240 
Pro He Asn Tyr Val He His Leu Ala Glu Phe Pro Pro Glu Leu Leu 
65 70 75 80 

ctg caa tec ctg caa ctg tgc etc aac act tgg tgc ttc get ctg aag 288 
Leu Gin Ser Leu Gin Leu Cys Leu Asn Thr Trp Cys Phe Ala Leu Lys 
85 90 95 

ttc ttc act ctg ate gtc tat acg cac cgc ttg gag ctg gec aac aag 336 
Phe Phe Thr Leu He Val Tyr Thr His Arg Leu Glu Leu Ala Asn Lys 
100 105 110 

cac ttt gac gaa ttg gat aag tac tgc gtg aag ccg gcg gag aag cgc 384 
His Phe Asp Glu Leu Asp Lys Tyr Cys Val Lys Pro Ala Glu Lys Arg 
115 120 125 

aag gtt cgc gac atg gtg gec act att aca aga ctg tac ctg acc ttc 432 
Lys Val Arg Asp Met Val Ala Thr He Thr Arg Leu Tyr Leu Thr Phe 
130 135 140 

gtc gtg gtc tac gtc etc tac gec acc tec acg eta ctg gac gga eta 480 
Val Val Val Tyr Val Leu Tyr Ala Thr Ser Thr Leu Leu Asp Gly Leu 
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ctg cac cac cgt gtt ccc tac aat acg tac tat ccg ttc ata aac tgg 
Leu His His Arg Val Pro Tyr Asn Thr Tyr Tyr Pro Phe lie Asn Trp 
165 170 175 

cga gtc gat egg acc cag atg tac ate cag agt ttt ctg gag tac ttc 
Arg Val Asp Arg Thr Gin Met Tyr lie Gin Ser Phe Leu Glu Tyr Phe 
180 185 190 

acc gtg ggt tat gec ata tat gtg gec acc gec acc gat tec tac cct 
Thr Val Gly Tyr Ala lie Tyr Val Ala Thr Ala Thr Asp Ser Tyr Pro 
195 200 205 

gtg att tac gtg gca gec ctg cga act cat att etc ttg etc aag gac 
Val lie Tyr Val Ala Ala Leu Arg Thr His lie Leu Leu Leu Lys Asp 
210 215 220 

cgt ate att tac ttg ggc gat ccc age aac gag ggt age age gac ccg 
Arg lie lie Tyr Leu Gly Asp Pro Ser Asn Glu Gly Ser Ser Asp Pro 
225 230 235 240 

age tac atg ttt aaa teg ttg gtg gat tgt ate aag gca cac aga acc 
Ser Tyr Met Phe Lys Ser Leu Val Asp Cys lie Lys Ala His Arg Thr 
245 250 255 

atg eta aat ttt tgt gat gee att caa cca ate ate tct ggc acg ata 
Met Leu Asn Phe Cys Asp Ala lie Gin Pro lie lie Ser Gly Thr lie 
260 265 270 

ttt gec caa ttc ate ata tgc gga teg ate ctg ggc ata att atg ate 
Phe Ala Gin Phe He He Cys Gly Ser He Leu Gly, He He Met He 
275 280 285 

aac atg gta ttg ttc get gat caa teg acc cga ttc ggc ata gtc ate 
Asn Met Val Leu Phe Ala Asp Gin Ser Thr Arg Phe Gly He Val He 
290 295 300 

tac gtt atg gee gtc ctt ctg cag act ttt ccg ctt tgc ttc tac tgc 
Tyr Val Met Ala Val Leu Leu Gin Thr Phe Pro Leu Cys Phe Tyr Cys 
305 310 315 320 

aac gee ate gtg gac gac tgc aaa gaa ctg gee cac gca ctt ttc cat 
Asn Ala He Val Asp Asp Cys Lys Glu Leu Ala His Ala Leu Phe His 
325 330 335 

tec gec tgg tgg gtg cag gac aag cga tac cag egg act gtc ate cag 
Ser Ala Trp Trp Val Gin Asp Lys Arg Tyr Gin Arg Thr Val He Gin 
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340 



345 



350 



ttc ctg cag aaa ctg cag cag ccc atg acc ttc acc gcc atg aac ata 1104 
Phe Leu Gin Lys Leu Gin Gin Pro Met Thr Phe Thr Ala Met Asn He 
355 360 365 

ttt aac att aat ttg gcc act aac ate aat gta gcc aag ttc gcc ttc 1152 
Phe Asn He Asn Leu Ala Thr Asn He Asn Val Ala Lys Phe Ala Phe 
370 375 380 

acc gtg tac gcc ate gcg age ggt atg aac ctg gac caa aag tta age 1200 
Thr Val Tyr Ala He Ala Ser Gly Met Asn Leu Asp Gin Lys Leu Ser 
385 390 395 400 



att aag gaa tag 
He Lys Glu 



<210> 10 
<211> 403 
<212> PRT 

<213> Drosophila melanogaster 
<400> 10 

Met Phe Gly His Phe Lys Leu Val Tyr Pro Ala Pro He Ser Glu Pro 



He Gin Ser Arg Asp Ser Asn Ala Tyr Met Met Glu Thr Leu Arg Asn 



Ser Gly Leu Asn Leu Lys Asn Asp Phe Gly He Gly Arg Lys He Trp 



Arg Val Phe Ser Phe Thr Tyr Asn Met Val He Leu Pro Val Ser Phe 



Pro lie Asn Tyr Val He His Leu Ala Glu Phe Pro Pro Glu Leu Leu 



Leu Gin Ser Leu Gin Leu Cys Leu Asn Thr Trp Cys Phe Ala Leu Lys 



Phe Phe Thr Leu He Val Tyr Thr His Arg Leu Glu Leu Ala Asn Lys 
100 105 110 

His Phe Asp Glu Leu Asp Lys Tyr Cys Val Lys Pro Ala Glu Lys Arg 
115 120 125 
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Lys Val Arg Asp Met Val Ala Thr lie Thr Arg Leu Tyr Leu Thr Phe 
130 135 140 



Val Val Val Tyr Val Leu Tyr Ala Thr Ser Thr Leu Leu Asp Gly Leu 
145 150 155 160 

Leu His His Arg Val Pro Tyr Asn Thr Tyr Tyr Pro Phe lie Asn Trp 
165 170 175 

Arg Val Asp Arg Thr Gin Met Tyr lie Gin Ser Phe Leu Glu Tyr Phe 
180 185 190 

Thr Val Gly Tyr Ala He Tyr Val Ala Thr Ala Thr Asp Ser Tyr Pro 
195 200 205 

Val He Tyr Val Ala Ala Leu Arg Thr His He Leu Leu Leu Lys Asp 
210 215 220 

Arg lie lie Tyr Leu Gly Asp Pro Ser Asn Glu Gly Ser Ser Asp Pro 
225 230 235 240 

Ser Tyr Met Phe Lys Ser Leu Val Asp Cys He Lys Ala His Arg Thr 
245 250 255 

Met Leu Asn Phe Cys Asp Ala He Gin Pro He He Ser Gly Thr He 
260 265 270 

Phe Ala Gin Phe He He Cys Gly Ser He Leu Gly He He Met He 
275 280 285 

Asn Met Val Leu Phe Ala Asp Gin Ser Thr Arg Phe Gly He Val He 
290 295 300 

Tyr Val Met Ala Val Leu Leu Gin Thr Phe Pro Leu Cys Phe Tyr Cys 
305 310 315 320 

Asn Ala He Val Asp Asp Cys Lys Glu Leu Ala His Ala Leu Phe His 
325 330 335 

Ser Ala Trp Trp Val Gin Asp Lys Arg Tyr Gin Arg Thr Val lie Gin 
340 345 350 

Phe Leu Gin Lys Leu Gin Gin Pro Met Thr Phe Thr Ala Met Asn He 
355 360 365 

Phe Asn He Asn Leu Ala Thr Asn He Asn Val Ala Lys Phe Ala Phe 
370 375 380 
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Thr Val Tyr Ala lie Ala Ser Gly Met Asn Leu Asp Gin Lys Leu Ser 
385 390 395 400 



lie Lys Glu 



<210> 11 
<211> 1137 
<212> DNA 

<213> Drosophila melanogaster 



<220> 

<221> CDS 

<222> (1) . . (1134) 

<223> DOR 33B.1, a coding region on BDGP Clone No. 
AC006240 



<400> 11 

atg gat tea aga agg aaa gtc cga 
Met Asp Ser Arg Arg Lys Val Arg 
1 5 

tgg ctt tac tgg cga ctt ctg gga 
Trp Leu Tyr Trp Arg Leu Leu Gly 

20 



agt gaa aat ctt tac aaa acc tat 48 
Ser Glu Asn Leu Tyr Lys Thr Tyr 
10 15 

gtc gag ggc gat tat cct ttt cga 96 
Val Glu Gly Asp Tyr Pro Phe Arg 
25 30 



egg eta gtg gat ttt aca ate acg tct ttc att acg att tta ttt ccc 144 
Arg Leu Val Asp Phe Thr lie Thr Ser Phe lie Thr He Leu Phe Pro 
35 40 45 



gtg cat ctt ata ctg gga atg tat aaa aag ccc cag att caa gtc ttc 192 
Val His Leu He Leu Gly Met Tyr Lys Lys Pro Gin He Gin Val Phe 
50 55 60 



agg agt ctg cat ttc aca teg gaa tgc ctt ttc tgc age tat aag ttt 240 
Arg Ser Leu His Phe Thr Ser Glu Cys Leu Phe Cys Ser Tyr Lys Phe 
65 70 75 80 



ttc tgt ttt cgt tgg aaa ctt aaa gaa ata aag acc ate gaa gga ttg 288 
Phe Cys Phe Arg Trp Lys Leu Lys Glu He Lys Thr He Glu Gly Leu 
85 90 95 



etc cag gat etc gat agt cga gtt gaa agt gaa gaa gaa cgc aac tac 336 
Leu Gin Asp Leu Asp Ser Arg Val Glu Ser Glu Glu Glu Arg Asn Tyr 
100 105 110 



20 



ttt aat caa aat cca agt cgt gtg get cga atg ctt teg aaa agt tac 
Phe Asn Gin Asn Pro Ser Arg Val Ala Arg Met Leu Ser Lys Ser Tyr 



ttg gta get get ata teg gee ata ate act gca act gta get ggt tta 
Leu Val Ala Ala lie Ser Ala lie lie Thr Ala Thr Val Ala Gly Leu 
130 135 140 

ttt agt act ggt cga aat tta atg tat ctg ggt tgg ttt ccc tac gat 
Phe Ser Thr Gly Arg Asn Leu Met Tyr Leu Gly Trp Phe Pro Tyr Asp 
145 150 155 160 

ttt caa gca ace gee gca ate tat tgg att agt ttt tec tat cag gcg 
Phe Gin Ala Thr Ala Ala lie Tyr Trp lie Ser Phe Ser Tyr Gin Ala 
165 170 175 

att ggc tct agt ctg ttg att ctg gaa aat ctg gee aac gat tea tat 
lie Gly Ser Ser Leu Leu lie Leu Glu Asn Leu Ala Asn Asp Ser Tyr 
180 185 190 

ccg ccg att aca ttt tgt gtg gtc tct gga cat gtg aga eta ttg ata 
Pro Pro lie Thr Phe Cys Val Val Ser Gly His Val Arg Leu Leu lie 
195 200 205 

atg cgt tta agt cga att ggt cac gat gta aaa tta tea agt teg gaa 
Met Arg Leu Ser Arg lie Gly His Asp Val Lys Leu Ser Ser Ser Glu 
210 215 220 

aat acc aga aaa etc ate gaa ggt ate cag gat cac agg aaa eta atg 
Asn Thr Arg Lys Leu lie Glu Gly He Gin Asp His Arg Lys Leu Met 
225 230 235 240 

aag ata ata cgc eta ctt cgc age act tta cat ctt age caa ctg ggc 
Lys He He Arg Leu Leu Arg Ser Thr Leu His Leu Ser Gin Leu Gly 
245 250 255 

cag ttc ctt tct agt gga ate aac att tec ata aca etc ate aac ate 
Gin Phe Leu Ser Ser Gly He Asn He Ser He Thr Leu He Asn He 
260 265 270 

ctg ttc ttt gcg gaa aac aac ttt gca atg ctt tat tat gcg gtg ttc 
Leu Phe Phe Ala Glu Asn Asn Phe Ala Met Leu Tyr Tyr Ala Val Phe 
275 280 285 

ttt get gca atg tta ata gaa eta ttt cca agt tgt tac tat gga att 
Phe Ala Ala Met Leu He Glu Leu Phe Pro Ser Cys Tyr Tyr Gly He 
290 295 300 



21 



ctg atg 
Leu Met 
305 



aca atg gag ttt gat aag eta cca tat gec ate ttc tec age 960 
Thr Met Glu Phe Asp Lys Leu Pro Tyr Ala lie Phe Ser Ser 
310 315 320 



aac tgg ctt aaa atg gat aaa aga tac aat cga tec ttg ata att ctg 1008 

Asn Trp Leu Lys Met Asp Lys Arg Tyr Asn Arg Ser Leu lie lie Leu 

325 330 335 

• 

atg caa eta aca ctg gtt cca gtg aat ata aaa gca ggt ggt att gtt 1056 

Met Gin Leu Thr Leu Val Pro Val Asn lie Lys Ala Gly Gly lie Val 

340 345 350 

ggc ate gat atg agt gca ttt ttt gec aca gtt egg atg gca tat tec 1104 
Gly He Asp Met Ser Ala Phe Phe Ala Thr Val Arg Met Ala Tyr Ser 
355 360 365 

ttt tac act tta gec ttg tea ttt cga gta tag 1137 
Phe Tyr Thr Leu Ala Leu Ser Phe Arg Val 
370 375 



<210> 12 
<211> 378 
<212> PRT 

<213> Drosophila melanogaster 
<400> 12 

Met Asp Ser Arg Arg Lys Val Arg Ser Glu Asn Leu Tyr Lys Thr Tyr 
15 10 15 

Trp Leu Tyr Trp Arg Leu Leu Gly Val Glu Gly Asp Tyr Pro Phe Arg 
20 25 30 

Arg Leu Val Asp Phe Thr lie Thr Ser Phe He Thr lie Leu Phe Pro 



Val His Leu He Leu Gly Met Tyr Lys Lys Pro Gin He Gin Val Phe 
50 55 60 

Arg Ser Leu His Phe Thr Ser Glu Cys Leu Phe Cys Ser Tyr Lys Phe 
65 70 75 80 

Phe Cys Phe Arg Trp Lys Leu Lys Glu He Lys Thr He Glu Gly Leu 
85 90 95 

Leu Gin Asp Leu Asp Ser Arg Val Glu Ser Glu Glu Glu Arg Asn Tyr 
100 105 110 
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Phe Asn Gin Asn Pro Ser Arg Val Ala Arg Met Leu Ser Lys Ser Tyr 
115 120 125 



Leu Val Ala Ala lie Ser Ala lie He Thr Ala Thr Val Ala Gly Leu 
130 135 140 

Phe Ser Thr Gly Arg Asn Leu Met Tyr Leu Gly Trp Phe Pro Tyr Asp 
145 150 155 160 

Phe Gin Ala Thr Ala Ala He Tyr Trp He Ser Phe Ser Tyr Gin Ala 
165 170 175 

lie Gly Ser Ser Leu Leu He Leu Glu Asn Leu Ala Asn Asp Ser Tyr 
180 185 190 

Pro Pro He Thr Phe Cys Val Val Ser Gly His Val Arg Leu Leu He 
195 200 205 

Met Arg Leu Ser Arg He Gly His Asp Val Lys Leu Ser Ser Ser Glu 
210 215 220 

Asn Thr Arg Lys Leu He Glu Gly He Gin Asp His Arg Lys Leu Met 
225 230 235 240 

Lys He He Arg Leu Leu Arg Ser Thr Leu His Leu Ser Gin Leu Gly 
245 250 255 

Gin Phe Leu Ser Ser Gly He Asn He Ser He Thr Leu He Asn He 
260 265 270 

Leu Phe Phe Ala Glu Asn Asn Phe Ala Met Leu Tyr Tyr Ala Val Phe 
275 280 285 

Phe Ala Ala Met Leu lie Glu Leu Phe Pro Ser Cys Tyr Tyr Gly He 
290 295 300 

Leu Met Thr Met Glu Phe Asp Lys Leu Pro Tyr Ala He Phe Ser Ser 
305 310 315 320 

Asn Trp Leu Lys Met Asp Lys Arg Tyr Asn Arg Ser Leu lie lie Leu 
325 330 335 

Met Gin Leu Thr Leu Val Pro Val Asn He Lys Ala Gly Gly lie Val 
340 345 350 

Gly He Asp Met Ser Ala Phe Phe Ala Thr Val Arg Met Ala Tyr Ser 
355 360 365 
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Phe Tyr Thr Leu Ala Leu Ser Phe Arg Val 
370 375 



<210> 13 
<211> 1140 
<212> DNA 

<213> Drosophila melanogaster 



<220> 

<221> CDS 

<222> (1) . . (1137) 

<223> DOR 33B.2, a coding region on BDGP Clone No. 
AC006240 



<400> 13 

atg gac tta aaa ccg cga gtc att cga agt gaa gat ate tac aga acc 48 
Met Asp Leu Lys Pro Arg Val lie Arg Ser Glu Asp lie Tyr Arg Thr 
15 10 15 



tat tgg tta tat tgg cat ctt ttg ggc ctg gaa age aat ttc ttt ctg 96 
Tyr Trp Leu Tyr Trp His Leu Leu Gly Leu Glu Ser Asn Phe Phe Leu 
20 25 30 



aat cgc ttg ttg gat ttg gtg att aca att ttc gta acc att tgg tat 144 
Asn Arg Leu Leu Asp Leu Val lie Thr lie Phe Val Thr lie Trp Tyr 
35 40 45 



cca att cac ctg att ctg gga ctg ttt atg gaa aga tct ttg ggg gat 192 
Pro lie His Leu lie Leu Gly Leu Phe Met Glu Arg Ser Leu Gly Asp 
50 55 60 



gtc tgc aag ggt eta cca att acg gca gca tgc ttt ttc gec age ttt 240 
Val Cys Lys Gly Leu Pro lie Thr Ala Ala Cys Phe Phe Ala Ser Phe 
65 70 75 80 



aaa ttt att tgt ttt cgc ttc aag eta tct gaa att aaa gaa ate gaa 288 
Lys Phe lie Cys Phe Arg Phe Lys Leu Ser Glu lie Lys Glu lie Glu 
85 90 95 



ata tta ttt aaa gag ctg gat cag cga get tta agt cga gag gaa tgc 336 
lie Leu Phe Lys Glu Leu Asp Gin Arg Ala Leu Ser Arg Glu Glu Cys 
100 105 110 



gag ttt ttc aat caa aat acg aga cgt gag gcg aat ttc att tgg aaa 
Glu Phe Phe Asn Gin Asn Thr Arg Arg Glu Ala Asn Phe lie Trp Lys 
115 120 125 
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agt ttc att gtg gcc tat gga ctg teg aat ate teg get att gca tea 
Ser Phe He Val Ala Tyr Gly Leu Ser Asn He Ser Ala He Ala Ser 
130 135 140 

gtt ctt ttc ggc ggt gga cat aag eta tta tat ccc gcc tgg ttt cca 
Val Leu Phe Gly Gly Gly His Lys Leu Leu Tyr Pro Ala Trp Phe Pro 
145 150 155 160 

tac gat gtg cag gcc acg gaa eta ata ttt tgg eta agt gta aca tac 
Tyr Asp Val Gin Ala Thr Glu Leu He Phe Trp Leu Ser Val Thr Tyr 
165 170 175 

caa att gcc gga gta agt ttg gcc ata ctt cag aat ttg gcc aat gat 
Gin He Ala Gly Val Ser Leu Ala He Leu Gin Asn Leu Ala Asn Asp 
180 185 190 

tec tat cca ccg atg aca ttt tgc gtg gtt gcc ggt cat gta aga ctt 
Ser Tyr Pro Pro Met Thr Phe Cys Val Val Ala Gly His Val Arg Leu 
195 200 205 

ttg gcg atg cgc ttg agt aga att ggc caa ggt cca gag gaa aca ata 
Leu Ala Met Arg Leu Ser Arg He Gly Gin Gly Pro Glu Glu Thr He 
210 215 220 

tac tta ace gga aag caa tta ate gaa age ate gag gat cac cga aaa 
Tyr Leu Thr Gly Lys Gin Leu He Glu Ser He Glu Asp His Arg Lys 
225 230 235 240 

eta atg aaa ata gtg gaa tta ctg cgc age ace atg aat att teg cag 
Leu Met Lys He Val Glu Leu Leu Arg Ser Thr Met Asn He Ser Gin 
245 250 255 

etc ggc cag ttt att tea agt ggt gtt aat att tec ata aca eta gtc 
Leu Gly Gin Phe He Ser Ser Gly Val Asn He Ser lie Thr Leu Val 
260 265 270 

aac att etc ttc ttt gcg gat aat aat ttc get ata acc tac tac gga 
Asn He Leu Phe Phe Ala Asp Asn Asn Phe Ala He Thr Tyr Tyr Gly 
275 280 285 

gtg tac ttc eta teg atg gtg ttg gaa tta ttc ccg tgc tgc tat tac 
Val Tyr Phe Leu Ser Met Val Leu Glu Leu Phe Pro Cys Cys Tyr Tyr 
290 295 300 

ggc acc ctg ata tec gtg gag atg aac cag ctg acc tat gcg att tac 
Gly Thr Leu He Ser Val Glu Met Asn Gin Leu Thr Tyr Ala He Tyr 
305 310 315 320 
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tea agt aac tgg atg agt atg aat egg age tac age cgc ate eta ctg 
Ser Ser Asn Trp Met Ser Met Asn Arg Ser Tyr Ser Arg lie Leu Leu 
325 330 335 



1008 



ate ttc atg caa etc ace ctg gcg gaa gtg cag ate aag gee ggt ggg 1056 
lie Phe Met Gin Leu Thr Leu Ala Glu Val Gin lie Lys Ala Gly Gly 
340 345 350 

atg att ggc ate gga atg aac gec ttc ttt gee acc gtg cga ttg gec 1104 
Met lie Gly lie Gly Met Asn Ala Phe Phe Ala Thr Val Arg Leu Ala 
355 360 365 

tac tec ttc ttc act ttg gee atg teg ctg cgt taa 1140 
Tyr Ser Phe Phe Thr Leu Ala Met Ser Leu Arg 
370 375 



<210> 14 
<211> 379 
<212> PRT 

<213> Drosophila melanogaster 
<400> 14 

Met Asp Leu Lys Pro Arg Val lie Arg Ser Glu Asp lie Tyr Arg Thr 



Tyr Trp Leu Tyr Trp His Leu Leu Gly Leu Glu Ser Asn Phe Phe Leu 
20 25 30 

Asn Arg Leu Leu Asp Leu Val lie Thr lie Phe Val Thr lie Trp Tyr 
35 40 45 

Pro lie His Leu lie Leu Gly Leu Phe Met Glu Arg Ser Leu Gly Asp 
50 55 60 

Val Cys Lys Gly Leu Pro lie Thr Ala Ala Cys Phe Phe Ala Ser Phe 



Lys Phe lie Cys Phe Arg Phe Lys Leu Ser Glu lie Lys Glu lie Glu 

85 90 95 

lie Leu Phe Lys Glu Leu Asp Gin Arg Ala Leu Ser Arg Glu Glu Cys 

100 105 110 

Glu Phe Phe Asn Gin Asn Thr Arg Arg Glu Ala Asn Phe lie Trp Lys 
115 120 125 
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Ser Phe lie Val Ala 
130 

Val Leu Phe Gly Gly 
145 

Tyr Asp Val Gin Ala 
165 

Gin He Ala Gly Val 
180 

Ser Tyr Pro Pro Met 
195 

Leu Ala Met Arg Leu 
210 

Tyr Leu Thr Gly Lys 
225 

Leu Met Lys He Val 
245 

Leu Gly Gin Phe He 

260 

Asn He Leu Phe Phe 
275 

Val Tyr Phe Leu Ser 
290 

Gly Thr Leu He Ser 
305 

Ser Ser Asn Trp Met 
325 

He Phe Met Gin Leu 
340 

Met He Gly He Gly 
355 

Tyr Ser Phe Phe Thr 
370 



Tyr Gly Leu Ser Asn He 
135 

Gly His Lys Leu Leu Tyr 
150 155 

Thr Glu Leu He Phe Trp 
170 

Ser Leu Ala He Leu Gin 
185 

Thr Phe Cys Val Val Ala 
200 

Ser Arg He Gly Gin Gly 
215 

Gin Leu He Glu Ser He 
230 235 

Glu Leu Leu Arg Ser Thr 
250 

Ser Ser Gly Val Asn He 
265 

Ala Asp Asn Asn Phe Ala 
280 

Met Val Leu Glu Leu Phe 
295 

Val Glu Met Asn Gin Leu 
310 315 

Ser Met Asn Arg Ser Tyr 
330 

Thr Leu Ala Glu Val Gin 
345 

Met Asn Ala Phe Phe Ala 
360 

Leu Ala Met Ser Leu Arg 
375 



Ser Ala He Ala Ser 

140 

Pro Ala Trp Phe Pro 
160 

Leu Ser Val Thr Tyr 
175 

Asn Leu Ala Asn Asp 
190 

Gly His Val Arg Leu 
205 

Pro Glu Glu Thr He 
220 

Glu Asp His Arg Lys 
240 

Met Asn He Ser Gin 
255 

Ser lie Thr Leu Val 
270 

He Thr Tyr Tyr Gly 
285 

Pro Cys Cys Tyr Tyr 
300 

Thr Tyr Ala He Tyr 
320 

Ser Arg He Leu Leu 
335 

He Lys Ala Gly Gly 
350 

Thr Val Arg Leu Ala 
365 
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<210> 15 
<211> 1155 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1152) 

<223> DOR 33B3.3, a coding region on BDGP Clone No. 
AC006240 



atg gtc att ate gac agt ctt agt ttt tat cgt cca ttc tgg ate tgc 
Met Val lie lie Asp Ser Leu Ser Phe Tyr Arg Pro Phe Trp lie Cys 



atg cga ttg ctg gta ccg act ttc ttc aag gat tec tea cgt cct gtc 
Met Arg Leu Leu Val Pro Thr Phe Phe Lys Asp Ser Ser Arg Pro Val 
20 25 30 

cag ctg tac gtg gtg ttg ctg cac ate ctg gtc ace ttg tgg ttt cca 
Gin Leu Tyr Val Val Leu Leu His lie Leu Val Thr Leu Trp Phe Pro 
35 40 45 

ctg cat ctg ctg ctg cat ctt ctg eta ctt cca tct ace get gag ttc 
Leu His Leu Leu Leu His Leu Leu Leu Leu Pro Ser Thr Ala Glu Phe 



ttt aag aac ctg acc atg tct ctg act tgt gtg gee tgc agt ctg aag 
Phe Lys Asn Leu Thr Met Ser Leu Thr Cys Val Ala Cys Ser Leu Lys 
65 70 75 80 

cat gtg gee cac ttg tat cac ttg ccg cag att gtg gaa ate gaa tea 
His Val Ala His Leu Tyr His Leu Pro Gin lie Val Glu lie Glu Ser 
85 90 95 

ctg ate gag caa tta gac aca ttt att gec age gaa cag gag cat cgt 
Leu lie Glu Gin Leu Asp Thr Phe lie Ala Ser Glu Gin Glu His Arg 
100 105 110 

tac tat egg gat cac gta cat tgc cat get agg cgc ttt aca aga tgt 
Tyr Tyr Arg Asp His Val His Cys His Ala Arg Arg Phe Thr Arg Cys 
115 120 125 

etc tat att age ttt ggc atg ate tat gcg ctt ttc ctg ttc ggc gtc 
Leu Tyr lie Ser Phe Gly Met lie Tyr Ala Leu Phe Leu Phe Gly Val 
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130 



135 



140 



ttc gtt cag gtt att age gga aat tgg gaa ctt etc tat cca gec tat 
Phe Val Gin Val lie Ser Gly Asn Trp Glu Leu Leu Tyr Pro Ala Tyr 
145 150 155 160 

ttc cca ttc gac ttg gag age aat cgc ttt etc ggc gca gta gec ttg 
Phe Pro Phe Asp Leu Glu Ser Asn Arg Phe Leu Gly Ala Val Ala Leu 
165 170 175 

ggc tat cag gta ttc age atg tta gtt gaa ggc ttc cag ggg ctg ggc 
Gly Tyr Gin Val Phe Ser Met Leu Val Glu Gly Phe Gin Gly Leu Gly 
180 185 190 

aac gat acc tat ace cca ctg acc eta tgc ctt ctg gec gga cat gtc 
Asn Asp Thr Tyr Thr Pro Leu Thr Leu Cys Leu Leu Ala Gly His Val 
195 200 205 

cat ttg tgg tec ata cga atg ggt caa ctg gga tac ttc gat gac gag 
His Leu Trp Ser lie Arg Met Gly Gin Leu Gly Tyr Phe Asp Asp Glu 
210 215 220 

acg gtg gtg aat cat cag cgt ttg ctg gat tac att gag cag cat aaa 
Thr Val Val Asn His Gin Arg Leu Leu Asp Tyr lie Glu Gin His Lys 
225 230 235 240 

etc ttg gtg cga ttc cac aac ctg gtg age egg acc ate age gaa gtg 
Leu Leu Val Arg Phe His Asn Leu Val Ser Arg Thr lie Ser Glu Val 
245 250 255 

caa ctg gtg cag ctg ggc gga tgt gga gec act ctg tgc ate att gtc 
Gin Leu Val Gin Leu Gly Gly Cys Gly Ala Thr Leu Cys lie lie Val 
260 265 270 

tec tac atg etc ttc ttt gtg ggc gac aca ate teg ctg gtc tac tac 
Ser Tyr Met Leu Phe Phe Val Gly Asp Thr lie Ser Leu Val Tyr Tyr 
275 280 285 

ttg gtg ttc ttt gga gtg gtc tgc gtg cag etc ttt ccc age tgc tat 
Leu Val Phe Phe Gly Val Val Cys Val Gin Leu Phe Pro Ser Cys Tyr 
290 295 300 

ttt gee age gaa gta gee gag gag ttg gaa egg ctg cca tat gcg ate 
Phe Ala Ser Glu Val Ala Glu Glu Leu Glu Arg Leu Pro Tyr Ala He 
305 310 315 320 

ttc tec age aga tgg tac gat caa teg egg gat cat cga ttc gat ttg 
Phe Ser Ser Arg Trp Tyr Asp Gin Ser Arg Asp His Arg Phe Asp Leu 
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325 



330 



335 



etc ate ttt aca caa tta 
Leu lie Phe Thr Gin Leu 
340 

gca gga ggt ctt ate gag 
Ala Gly Gly Leu lie Glu 
355 

aag atg gec tat tec ctt 
Lys Met Ala Tyr Ser Leu 
370 

tag 



aca ctg gga aac egg ggg 
Thr Leu Gly Asn Arg Gly 
345 

ctg aat ttg aat gec ttt 
Leu Asn Leu Asn Ala Phe 
360 

ttt gca gtt gtg gtg egg 
Phe Ala Val Val Val Arg 
375 380 



tgg ate ate aag 1056 
Trp lie lie Lys 
350 

ttc gec acc ctg 1104 

Phe Ala Thr Leu 

365 

gca aag ggt ata 1152 
Ala Lys Gly lie 



1155 



<210> 16 
<211> 384 
<212> PRT 

<213> Drosophila melanogaster 
<400> 16 

Met Val lie lie Asp Ser Leu Ser Phe Tyr Arg Pro Phe Trp lie Cys 



Met Arg Leu Leu Val Pro Thr Phe Phe Lys Asp Ser Ser Arg Pro Val 
20 25 30 

Gin Leu Tyr Val Val Leu Leu His lie Leu Val Thr Leu Trp Phe Pro 
35 40 45 

Leu His Leu Leu Leu His Leu Leu Leu Leu Pro Ser Thr Ala Glu Phe 
50 55 60 

Phe Lys Asn Leu Thr Met Ser Leu Thr Cys Val Ala Cys Ser Leu Lys 



His Val Ala His Leu Tyr His Leu Pro Gin lie Val Glu lie Glu Ser 
85 90 95 

Leu lie Glu Gin Leu Asp Thr Phe lie Ala Ser Glu Gin Glu His Arg 
100 105 110 

Tyr Tyr Arg Asp His Val His Cys His Ala Arg Arg Phe Thr Arg Cys 
115 120 125 

Leu Tyr lie Ser Phe Gly Met lie Tyr Ala Leu Phe Leu Phe Gly Val 
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130 



135 



140 



Phe Val Gin Val lie Ser Gly Asn Trp Glu Leu Leu Tyr Pro Ala Tyr 
145 150 155 160 

Phe Pro Phe Asp Leu Glu Ser Asn Arg Phe Leu Gly Ala Val Ala Leu 
165 170 175 

Gly Tyr Gin Val Phe Ser Met Leu Val Glu Gly Phe Gin Gly Leu Gly 
180 185 190 

Asn Asp Thr Tyr Thr Pro Leu Thr Leu Cys Leu Leu Ala Gly His Val 
195 200 205 

His Leu Trp Ser lie Arg Met Gly Gin Leu Gly Tyr Phe Asp Asp Glu 
210 215 220 

Thr Val Val Asn His Gin Arg Leu Leu Asp Tyr He Glu Gin His Lys 
225 230 235 240 

Leu Leu Val Arg Phe His Asn Leu Val Ser Arg Thr He Ser Glu Val 
245 250 255 

Gin Leu Val Gin Leu Gly Gly Cys Gly Ala Thr Leu Cys lie He Val 
260 265 270 

Ser Tyr Met Leu Phe Phe Val Gly Asp Thr He Ser Leu Val Tyr Tyr 
275 280 285 

Leu Val Phe Phe Gly Val Val Cys Val Gin Leu Phe Pro Ser Cys Tyr 
290 295 300 

Phe Ala Ser Glu Val Ala Glu Glu Leu Glu Arg Leu Pro Tyr Ala He 
305 310 315 320 

Phe Ser Ser Arg Trp Tyr Asp Gin Ser Arg Asp His Arg Phe Asp Leu 
325 330 335 

Leu He Phe Thr Gin Leu Thr Leu Gly Asn Arg Gly Trp He He Lys 
340 345 350 

Ala Gly Gly Leu He Glu Leu Asn Leu Asn Ala Phe Phe Ala Thr Leu 
355 360 365 

Lys Met Ala Tyr Ser Leu Phe Ala Val Val Val Arg Ala Lys Gly lie 
370 375 380 
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<210> 17 
<211> 1152 
<212> DNA 

<213> Drosophila melanogaster 



<220> 

<221> CDS 

<222> (1) . . (1149) 

<223> DOR 43B.1, coding region of AF127926 



<400> 17 

atg aca ate gag gat ate ggc ctg 
Met Thr lie Glu Asp lie Gly Leu 
1 5 

cga cac ttg gec gtg ctg tac ccc 
Arg His Leu Ala Val Leu Tyr Pro 
20 



gtg ggc ate aac gtg egg atg tgg 4 8 
Val Gly He Asn Val Arg Met Trp 
10 15 

act ccg ggc tec age tgg cgc aag 96 
Thr Pro Gly Ser Ser Trp Arg Lys 
25 30 



ttc gec ttc gtg ctg ccg gtg act gcg atg aat ctg atg cag ttc gtc 
Phe Ala Phe Val Leu Pro Val Thr Ala Met Asn Leu Met Gin Phe Val 
35 40 45 

tac ctg ctg egg atg tgg ggc gac ctg ccc gec ttc att ctg aac atg 
Tyr Leu Leu Arg Met Trp Gly Asp Leu Pro Ala Phe He Leu Asn Met 
50 55 60 

ttc ttc ttc teg gee att ttc aac gec ctg atg cgc acg tgg ctg gtc 
Phe Phe Phe Ser Ala He Phe Asn Ala Leu Met Arg Thr Trp Leu Val 
65 70 75 80 



ata ate aag egg cgc cag ttc gag gag ttt etc ggc caa ctg gee act 
lie He Lys Arg Arg Gin Phe Glu Glu Phe Leu Gly Gin Leu Ala Thr 
85 90 95 

ctg ttc cat teg att etc gac tec acc gac gag tgg ggg cgt ggc ate 
Leu Phe His Ser He Leu Asp Ser Thr Asp Glu Trp Gly Arg Gly He 
100 105 110 



ctg egg agg gcg gaa egg gag get egg aac ctg gec ate ctt aat ttg 
Leu Arg Arg Ala Glu Arg Glu Ala Arg Asn Leu Ala He Leu Asn Leu 
115 120 125 

agt gee tec ttc ctg gac att gtc ggt get ctg ttt ttc gaa tat aaa 
Ser Ala Ser Phe Leu Asp He Val Gly Ala Leu Phe Phe Glu Tyr Lys 
130 135 140 
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ttc cca att 
Phe Pro lie 
145 

tta get eta 
Leu Ala Leu 



ate tac ttg 
lie Tyr Leu 



atg cct ttc 
Met Pro Phe 
195 

atg ctg cag 
Met Leu Gin 
210 

cag teg gag 
Gin Ser Glu 
225 

cac acg cag 
His Thr Gin 



aac att gtg 
Asn lie Val 



ctg etc ttc 
Leu Leu Phe 
275 

ata gtg atg 
lie Val Met 
290 

aat egg gee 
Asn Arg Ala 
305 

gtt tac aat 
Val Tyr Asn 



ggt gtt gtc 
Gly Val Val 
150 

cca gga gtg 
Pro Gly Val 
165 

gee caa ctg 
Ala Gin Leu 
180 

gtc age ctt 
Val Ser Leu 



ate ctg gta 
lie Leu Val 



gag gag cgc 
Glu Glu Arg 
230 

gtg atg cgc 
Val Met Arg 
245 

gcg gtg gaa 
Ala Val Glu 
260 

tgt ctg aat 
Cys Leu Asn 



tac att ctg 
Tyr lie Leu 



aat gaa ata 
Asn Glu lie 
310 

gtg ccc tgg 
Val Pro Trp 
325 



act ttt ttc 
Thr Phe Phe 



age atg ace 
Ser Met Thr 



cct acg ccc 
Pro Thr Pro 
185 

ttt gee ggc 
Phe Ala Gly 
200 

cac agg ctg 
His Arg Leu 
215 

ttc caa agg 
Phe Gin Arg 



tat gtg tgg 
Tyr Val Trp 



gca att att 
Ala He He 
265 

att ata acc 
He He Thr 
280 

acc atg ctg 
Thr Met Leu 
295 

tgc etc gag 
Cys Leu Glu 



tac gag gca 
Tyr Glu Ala 



ctt cca get 
Leu Pro Ala 
155 

agt tea ccc 
Ser Ser Pro 
170 

ctg ctg ctg 
Leu Leu Leu 



ctg gee ate 
Leu Ala He 



ggc cag att 
Gly Gin He 
220 

ctg gec tec 
Leu Ala Ser 
235 

cag etc aac 
Gin Leu Asn 
250 

ttt ggc teg 
Phe Gly Ser 



tea ccc acc 
Ser Pro Thr 



tac gtt etc 
Tyr Val Leu 
300 

aac aac egg 
Asn Asn Arg 
315 

gga act egg 
Gly Thr Arg 
330 



cat ccc ttc 
His Pro Phe 



gtc tac gag 
Val Tyr Glu 
175 

tec atg atg 
Ser Met Met 
190 

ttt ggg aag 
Phe Gly Lys 
205 

ggc gga gaa 
Gly Gly Glu 



tgc att gcg 
Cys He Ala 



aaa ctg gtg 
Lys Leu Val 
255 

ata ate tgc 
He He Cys 
270 

cag gtg ate 
Gin Val He 
285 

ttc acc tac 
Phe Thr Tyr 



gtg gcg gag 
Val Ala Glu 



ttt cgc aaa 
Phe Arg Lys 
335 



ggc 480 

Gly 

160 

gtt 528 
Val 



tac 576 
Tyr 



gee 624 
Ala 



gag 672 
Glu 



tac 720 

Tyr 

240 

gee 7 68 
Ala 



tea 816 
Ser 



teg 864 
Ser 



tac 912 
Tyr 



get 960 

Ala 

320 

acc 1008 
Thr 
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etc ctg ate ttc ttg atg caa aca caa cac ccg atg gag ata aga gtc 1056 
Leu Leu lie Phe Leu Met Gin Thr Gin His Pro Met Glu lie Arg Val 
340 345 350 



ggc aac gtt tac ccc atg aca ttg gec atg ttc cag agt ctg ttg aat 1104 
Gly Asn Val Tyr Pro Met Thr Leu Ala Met Phe Gin Ser Leu Leu Asn 
355 360 365 

gcg tec tac tec tac ttt ace atg ctg cgt ggc gtc acc ggc aaa tga 1152 
Ala Ser Tyr Ser Tyr Phe Thr Met Leu Arg Gly Val Thr Gly Lys 
370 375 380 



<210> 18 
<211> 383 
<212> PRT 

<213> Drosophila melanogaster 
<400> 18 

Met Thr lie Glu Asp lie Gly Leu Val Gly lie Asn Val Arg Met Trp 
15 10 15 

Arg His Leu Ala Val Leu Tyr Pro Thr Pro Gly Ser Ser Trp Arg Lys 
20 25 30 

Phe Ala Phe Val Leu Pro Val Thr Ala Met Asn Leu Met Gin Phe Val 
35 40 45 

Tyr Leu Leu Arg Met Trp Gly Asp Leu Pro Ala Phe lie Leu Asn Met 
50 55 60 

Phe Phe Phe Ser Ala lie Phe Asn Ala Leu Met Arg Thr Trp Leu Val 
65 70 75 80 

He He Lys Arg Arg Gin Phe Glu Glu Phe Leu Gly Gin Leu Ala Thr 
85 90 95 

Leu Phe His Ser He Leu Asp Ser Thr Asp Glu Trp Gly Arg Gly He 
100 105 110 

Leu Arg Arg Ala Glu Arg Glu Ala Arg Asn Leu Ala He Leu Asn Leu 
115 120 125 

Ser Ala Ser Phe Leu Asp He Val Gly Ala Leu Phe Phe Glu Tyr Lys 
130 135 140 

Phe Pro He Gly Val Val Thr Phe Phe Leu Pro Ala His Pro Phe Gly 
145 150 155 160 

34 



IM 'l l 'HI 



Leu Ala Leu Pro Gly Val Ser Met Thr Ser Ser Pro Val Tyr Glu Val 
165 170 175 



lie Tyr Leu Ala Gin Leu Pro Thr Pro Leu Leu Leu Ser Met Met Tyr 
180 185 190 

Met Pro Phe Val Ser Leu Phe Ala Gly Leu Ala lie Phe Gly Lys Ala 
195 200 205 

Met Leu Gin He Leu Val His Arg Leu Gly Gin He Gly Gly Glu Glu 
210 215 220 

Gin Ser Glu Glu Glu Arg Phe Gin Arg Leu Ala Ser Cys He Ala Tyr 
225 230 235 240 

His Thr Gin Val Met Arg Tyr Val Trp Gin Leu Asn Lys Leu Val Ala 
245 250 255 

Asn He Val Ala Val Glu Ala lie lie Phe Gly Ser He lie Cys Ser 
260 265 270 

Leu Leu Phe Cys Leu Asn lie He Thr Ser Pro Thr Gin Val He Ser 
275 280 285 

He Val Met Tyr He Leu Thr Met Leu Tyr Val Leu Phe Thr Tyr Tyr 
290 295 300 

Asn Arg Ala Asn Glu He Cys Leu Glu Asn Asn Arg Val Ala Glu Ala 
305 310 315 320 

Val Tyr Asn Val Pro Trp Tyr Glu Ala Gly Thr Arg Phe Arg Lys Thr 
325 330 335 

Leu Leu He Phe Leu Met Gin Thr Gin His Pro Met Glu He Arg Val 
340 345 350 

Gly Asn Val Tyr Pro Met Thr Leu Ala Met Phe Gin Ser Leu Leu Asn 
355 360 365 

Ala Ser Tyr Ser Tyr Phe Thr Met Leu Arg Gly Val Thr Gly Lys 
370 375 380 



<210> 19 
<211> 1158 
<212> DNA 



35 



<213> Drosophila melanogaster 



<220> 

<221> CDS 

<222> (1) . . (1155) 

<223> DOR 46F.1, a coding region on BDGP Clone No. 
AC005974 



atg age aaa gga gta gaa ate ttt tac aag ggc cag aag gca ttc ttg 
Met Ser Lys Gly Val Glu lie Phe Tyr Lys Gly Gin Lys Ala Phe Leu 
15 10 15 

aac ate etc teg ttg tgg cct cag ata gaa cgc egg tgg aga ate ate 
Asn lie Leu Ser Leu Trp Pro Gin He Glu Arg Arg Trp Arg He He 
20 25 30 

cac cag gtg aac tat gtc cac gta att gtg ttt tgg gtg ctg etc ttt 
His Gin Val Asn Tyr Val His Val He Val Phe Trp Val Leu Leu Phe 
35 40 45 

gat etc etc ttg gtg etc cat gtg atg get aat ttg age tac atg tec 
Asp Leu Leu Leu Val Leu His Val Met Ala Asn Leu Ser Tyr Met Ser 



gag gtt gtg aaa gec ate ttt ate ctg gec acc agt gca ggg cac acc 
Glu Val Val Lys Ala He Phe He Leu Ala Thr Ser Ala Gly His Thr 
65 70 75 80 

acc aag ctg ctg tec ata aag gcg aac aat gtg cag atg gag gag etc 
Thr Lys Leu Leu Ser He Lys Ala Asn Asn Val Gin Met Glu Glu Leu 



ttt agg aga ttg gat aac gaa gag ttc cgt cct aga ggc gec aac gaa 
Phe Arg Arg Leu Asp Asn Glu Glu Phe Arg Pro Arg Gly Ala Asn Glu 
100 105 110 

gag ttg ate ttt gca gca gee tgt gaa aga agt agg aag ctt egg gac 
Glu Leu lie Phe Ala Ala Ala Cys Glu Arg Ser Arg Lys Leu Arg Asp 
115 120 125 

ttc tat gga gcg ctt teg ttt gec gec ttg age atg att etc ata ccc 
Phe Tyr Gly Ala Leu Ser Phe Ala Ala Leu Ser Met He Leu He Pro 
130 135 140 

cag ttc gec ttg gac tgg tec cac ctt ccg etc aaa aca tac aat ccg 
Gin Phe Ala Leu Asp Trp Ser His Leu Pro Leu Lys Thr Tyr Asn Pro 
145 150 155 160 

36 



ctt ggc gag aat acc ggc tea cct get tat tgg etc etc tac tgc tat 
Leu Gly Glu Asn Thr Gly Ser Pro Ala Tyr Trp Leu Leu Tyr Cys Tyr 
165 170 175 

cag tgt ctg gee ttg tec gta tec tgc ate acc aac ata gga ttc gac 
Gin Cys Leu Ala Leu Ser Val Ser Cys lie Thr Asn lie Gly Phe Asp 
180 185 190 

tea etc tgc tec tea ctg ttc ate ttc etc aag tgc cag ctg gac att 
Ser Leu Cys Ser Ser Leu Phe lie Phe Leu Lys Cys Gin Leu Asp lie 
195 200 205 

ctg gec gtg cga ctg gac aag ate ggt egg tta ate act act tct ggt 
Leu Ala Val Arg Leu Asp Lys He Gly Arg Leu He Thr Thr Ser Gly 
210 215 220 

ggc act gtg gaa cag caa ctt aag gaa aat ate cgc tat cac atg acc 
Gly Thr Val Glu Gin Gin Leu Lys Glu Asn He Arg Tyr His Met Thr 
225 230 235 240 

ate gtt gaa ctg teg aaa acc gtg gag cgt eta ctt tgc aag ccg att 
He Val Glu Leu Ser Lys Thr Val Glu Arg Leu Leu Cys Lys Pro He 
245 250 255 

teg gtg cag ate ttc tgc teg gtt ttg gtg ctg act gec aat ttc tat 
Ser Val Gin He Phe Cys Ser Val Leu Val Leu Thr Ala Asn Phe Tyr 
260 265 270 

gec att get gtg tta tct gac gag agg ctg gag etc ttt aag tat gtg 
Ala He Ala Val Leu Ser Asp Glu Arg Leu Glu Leu Phe Lys Tyr Val 
275 280 285 

acc tat cag gcg tgc atg ttg att cag att ttt ata ttg tgc tac tat 
Thr Tyr Gin Ala Cys Met Leu He Gin He Phe He Leu Cys Tyr Tyr 
290 295 300 

gec ggt gag gta acc cag cgc age ctg gac ctt ccg cac gag ctg tac 
Ala Gly Glu Val Thr Gin Arg Ser Leu Asp Leu Pro His Glu Leu Tyr 
305 310 315 320 

aag acc tec tgg gtg gac tgg gac tac agg age cga agg att gcg etc 
Lys Thr Ser Trp Val Asp Trp Asp Tyr Arg Ser Arg Arg He Ala Leu 
325 330 335 

etc ttt atg caa cgc ctt cac teg acc ttg agg att agg aca ctt aat 
Leu Phe Met Gin Arg Leu His Ser Thr Leu Arg He Arg Thr Leu Asn 
340 345 350 



37 



cca agt ctt ggt ttt gac tta atg etc ttc age teg gtg agt tct ttc 
Pro Ser Leu Gly Phe Asp Leu Met Leu Phe Ser Ser Val Ser Ser Phe 
355 360 365 



1104 



cgt gtt ttg act ttt ttg tgc act gta gec aat ttc cat aat gag get 1152 
Arg Val Leu Thr Phe Leu Cys Thr Val Ala Asn Phe His Asn Glu Ala 
370 375 380 

cat tag 1158 

His 

385 



<210> 20 
<211> 385 
<212> PRT 

<213> Drosophila melanogaster 
<400> 20 

Met Ser Lys Gly Val Glu lie Phe Tyr Lys Gly Gin Lys Ala Phe Leu 



Asn lie Leu Ser Leu Trp Pro Gin lie Glu Arg Arg Trp Arg lie lie 



His Gin Val Asn Tyr Val His Val lie Val Phe Trp Val Leu Leu Phe 
35 40 45 

Asp Leu Leu Leu Val Leu His Val Met Ala Asn Leu Ser Tyr Met Ser 



Glu Val Val Lys Ala lie Phe lie Leu Ala Thr Ser Ala Gly His Thr 
65 70 75 80 

Thr Lys Leu Leu Ser lie Lys Ala Asn Asn Val Gin Met Glu Glu Leu 



Phe Arg Arg Leu Asp Asn Glu Glu Phe Arg Pro Arg Gly Ala Asn Glu 
100 105 110 

Glu Leu He Phe Ala Ala Ala Cys Glu Arg Ser Arg Lys Leu Arg Asp 
115 120 125 

Phe Tyr Gly Ala Leu Ser Phe Ala Ala Leu Ser Met He Leu He Pro 
130 135 140 

Gin Phe Ala Leu Asp Trp Ser His Leu Pro Leu Lys Thr Tyr Asn Pro 



38 



145 



150 



155 



160 



Leu Gly Glu Asn Thr Gly Ser Pro Ala Tyr Trp Leu Leu Tyr Cys Tyr 
165 170 175 

Gin Cys Leu Ala Leu Ser Val Ser Cys lie Thr Asn lie Gly Phe Asp 
180 185 190 

Ser Leu Cys Ser Ser Leu Phe lie Phe Leu Lys Cys Gin Leu Asp lie 
195 200 205 

Leu Ala Val Arg Leu Asp Lys lie Gly Arg Leu lie Thr Thr Ser Gly 
210 215 220 

Gly Thr Val Glu Gin Gin Leu Lys Glu Asn lie Arg Tyr His Met Thr 
225 230 235 240 

lie Val Glu Leu Ser Lys Thr Val Glu Arg Leu Leu Cys Lys Pro lie 
245 250 255 

Ser Val Gin He Phe Cys Ser Val Leu Val Leu Thr Ala Asn Phe Tyr 
260 265 270 

Ala He Ala Val Leu Ser Asp Glu Arg Leu Glu Leu Phe Lys Tyr Val 
275 280 285 

Thr Tyr Gin Ala Cys Met Leu He Gin He Phe He Leu Cys Tyr Tyr 
290 295 300 

Ala Gly Glu Val Thr Gin Arg Ser Leu Asp Leu Pro His Glu Leu Tyr 
305 310 315 320 

Lys Thr Ser Trp Val Asp Trp Asp Tyr Arg Ser Arg Arg He Ala Leu 
325 330 335 

Leu Phe Met Gin Arg Leu His Ser Thr Leu Arg He Arg Thr Leu Asn 
340 345 350 

Pro Ser Leu Gly Phe Asp Leu Met Leu Phe Ser Ser Val Ser Ser Phe 
355 360 365 

Arg Val Leu Thr Phe Leu Cys Thr Val Ala Asn Phe His Asn Glu Ala 
370 375 380 

His 
385 
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<210> 21 
<211> 1155 
<212> DNA 

<213> Drosophila melanogaster 



<220> 

<221> CDS 

<222> (1) . . (1152) 

<223> DOR 46F.2, a coding region on BDGP Clone No. 
AC005974 



atg gtt acg gag gac ttt tat aag tac cag gtg tgg tac ttc caa ate 
Met Val Thr Glu Asp Phe Tyr Lys Tyr Gin Val Trp Tyr Phe Gin He 



ctt ggt gtt tgg cag etc ccc act tgg gec gca gac cac cag cgt cgt 
Leu Gly Val Trp Gin Leu Pro Thr Trp Ala Ala Asp His Gin Arg Arg 



ttt cag tec atg agg ttt ggc ttc ate ctg gtc ate ctg ttc ate atg 
Phe Gin Ser Met Arg Phe Gly Phe He Leu Val He Leu Phe He Met 
35 40 45 

ctg ctg ctt ttc tec ttc gaa atg ttg aac aac att tec caa gtt agg 
Leu Leu Leu Phe Ser Phe Glu Met Leu Asn Asn He Ser Gin Val Arg 
50 55 60 

gag ate eta aag gta ttc ttc atg ttc gec acg gaa ata tec tgc atg 
Glu He Leu Lys Val Phe Phe Met Phe Ala Thr Glu He Ser Cys Met 



gec aaa tta ttg cat ttg aag ttg aag age cgc aaa etc get ggc ttg 
Ala Lys Leu Leu His Leu Lys Leu Lys Ser Arg Lys Leu Ala Gly Leu 
85 90 95 

gtt gat gcg atg ttg tec cca gag ttc ggc gtt aaa agt gaa cag gaa 
Val Asp Ala Met Leu Ser Pro Glu Phe Gly Val Lys Ser Glu Gin Glu 
100 105 110 

atg cag atg ctg gaa ttg gat aga gtg gcg gtt gtc cgc atg agg aac 
Met Gin Met Leu Glu Leu Asp Arg Val Ala Val Val Arg Met Arg Asn 
115 120 125 

tec tac ggc ate atg tec ctg ggc gcg get tec ctg ate ctt ata gtt 
Ser Tyr Gly He Met Ser Leu Gly Ala Ala Ser Leu He Leu He Val 
130 135 140 



40 



ccc tgt ttc gac aac ttt ggc gag eta cca ctg gec atg ttg gag gta 
Pro Cys Phe Asp Asn Phe Gly Glu Leu Pro Leu Ala Met Leu Glu Val 
145 150 155 160 

tgc age ate gag gga tgg ate tgc tat tgg teg cag tac ctt ttc cac 
Cys Ser lie Glu Gly Trp lie Cys Tyr Trp Ser Gin Tyr Leu Phe His 
165 170 175 

teg att tgc ctg ctg ccc act tgt gtg ctg aat ata acc tac gac teg 
Ser lie Cys Leu Leu Pro Thr Cys Val Leu Asn lie Thr Tyr Asp Ser 
180 185 190 

gtg gec tac teg ttg etc tgt ttc ttg aag gtt cag eta caa atg ctg 
Val Ala Tyr Ser Leu Leu Cys Phe Leu Lys Val Gin Leu Gin Met Leu 
195 200 205 

gtc ctg cga tta gaa aag ttg ggt cct gtg ate gaa ccc cag gat aat 
Val Leu Arg Leu Glu Lys Leu Gly Pro Val lie Glu Pro Gin Asp Asn 
210 215 220 

gag aaa ate gca atg gaa ctg cgt gag tgt gee gec tac tac aac agg 
Glu Lys lie Ala Met Glu Leu Arg Glu Cys Ala Ala Tyr Tyr Asn Arg 
225 230 235 240 

att gtt cgt ttc aag gac ctg gtg gag ctg ttc ata aag ggg cca gga 
lie Val Arg Phe Lys Asp Leu Val Glu Leu Phe lie Lys Gly Pro Gly 
245 250 . 255 

tct gtg cag etc atg tgt tct gtt ctg gtg ctg gtg tec aac ctg tac 
Ser Val Gin Leu Met Cys Ser Val Leu Val Leu Val Ser Asn Leu Tyr 
260 265 270 

gac atg tec acc atg tec att gca aac ggc gat gec ate ttt atg etc 
Asp Met Ser Thr Met Ser lie Ala Asn Gly Asp Ala lie Phe Met Leu 
275 280 285 

aag acc tgt ate tat cag ctg gtg atg etc tgg cag ate ttc ate att 
Lys Thr Cys lie Tyr Gin Leu Val Met Leu Trp Gin lie Phe lie He 
290 295 300 

tgc tac gec tec aac gag gta act gtc cag age tct agg ttg tgt cac 
Cys Tyr Ala Ser Asn Glu Val Thr Val Gin Ser Ser Arg Leu Cys His 
305 310 315 320 

age ate tac age tec caa tgg acg gga tgg aac agg gca aac cgc egg 
Ser He Tyr Ser Ser Gin Trp Thr Gly Trp Asn Arg Ala Asn Arg Arg 
325 330 335 



41 



att gtc 
He Val 



ctt 
Leu 



etc atg atg cag cgc ttt aat tec ccg atg etc ctg age 
Leu Met Met Gin Arg Phe Asn Ser Pro Met Leu Leu Ser 
340 345 350 



1056 



ace ttt aac ccc acc ttt get ttc age ttg gag gee ttt ggt tct ate 1104 
Thr Phe Asn Pro Thr Phe Ala Phe Ser Leu Glu Ala Phe Gly Ser He 
355 360 365 



gtc aac tgc tec tac age tac ttc gca ctg ctg aag cgc gtc aac ag 
Val Asn Cys Ser Tyr Ser Tyr Phe Ala Leu Leu Lys Arg Val Asn Se 
370 375 380 



1152 



<210> 22 
<211> 384 
<212> PRT 

<213> Drosophila melanogaster 
<400> 22 

Met Val Thr Glu Asp Phe Tyr Lys Tyr Gin Val Trp Tyr Phe Gin He 



Leu Gly Val Trp Gin Leu Pro Thr Trp Ala Ala Asp His Gin Arg Arg 
20 25 30 

Phe Gin Ser Met Arg Phe Gly Phe He Leu Val He Leu Phe He Met 



Leu Leu Leu Phe Ser Phe Glu Met Leu Asn Asn He Ser Gin Val Arg 
50 55 60 

Glu He Leu Lys Val Phe Phe Met Phe Ala Thr Glu He Ser Cys Met 
65 70 75 80 

Ala Lys Leu Leu His Leu Lys Leu Lys Ser Arg Lys Leu Ala Gly Leu 



Val Asp Ala Met Leu Ser Pro Glu Phe Gly Val Lys Ser Glu Gin Glu 
100 105 110 

Met Gin Met Leu Glu Leu Asp Arg Val Ala Val Val Arg Met Arg Asn 
115 120 125 

Ser Tyr Gly He Met Ser Leu Gly Ala Ala Ser Leu He Leu He Val 
130 135 140 



42 



Pro Cys Phe Asp Asn Phe Gly Glu Leu Pro Leu Ala Met Leu Glu Val 
145 150 155 160 



Cys Ser lie Glu Gly Trp lie Cys Tyr Trp Ser Gin Tyr Leu Phe His 
165 170 175 

Ser lie Cys Leu Leu Pro Thr Cys Val Leu Asn lie Thr Tyr Asp Ser 
180 185 190 

Val Ala Tyr Ser Leu Leu Cys Phe Leu Lys Val Gin Leu Gin Met Leu 
195 200 205 

Val Leu Arg Leu Glu Lys Leu Gly Pro Val lie Glu Pro Gin Asp Asn 
210 215 220 

Glu Lys He Ala Met Glu Leu Arg Glu Cys Ala Ala Tyr Tyr Asn Arg 
225 230 235 240 

He Val Arg Phe Lys Asp Leu Val Glu Leu Phe He Lys Gly Pro Gly 
245 250 255 

Ser Val Gin Leu Met Cys Ser Val Leu Val Leu Val Ser Asn Leu Tyr 
260 265 270 

Asp Met Ser Thr Met Ser He Ala Asn Gly Asp Ala He Phe Met Leu 
275 280 285 

Lys Thr Cys He Tyr Gin Leu Val Met Leu Trp Gin He Phe He He 
290 295 300 

Cys Tyr Ala Ser Asn Glu Val Thr Val Gin Ser Ser Arg Leu Cys His 
305 310 315 320 

Ser He Tyr Ser Ser Gin Trp Thr Gly Trp Asn Arg Ala Asn Arg Arg 
325 330 335 

He Val Leu Leu Met Met Gin Arg Phe Asn Ser Pro Met Leu Leu Ser 
340 345 350 

Thr Phe Asn Pro Thr Phe Ala Phe Ser Leu Glu Ala Phe Gly Ser He 
355 360 365 

Val Asn Cys Ser Tyr Ser Tyr Phe Ala Leu Leu Lys Arg Val Asn Ser 
370 - 375 380 
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<210> 23 
<211> 1158 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1155) 

<223> DOR 47E.1, coding region of AF156880 
<400> 23 

atg gac agt ttt ctg caa gta cag aag age acc att gec ctt ctg ggc 
Met Asp Ser Phe Leu Gin Val Gin Lys Ser Thr lie Ala Leu Leu Gly 
15 10 15 

ttt gat etc ttt agt gaa aat cga gaa atg tgg aaa cgc ccc tat aga 
Phe Asp Leu Phe Ser Glu Asn Arg Glu Met Trp Lys Arg Pro Tyr Arg 
20 25 30 

gca atg aat gtg ttt age ata get gee att ttt ccc ttt ate ctg gca 
Ala Met Asn Val Phe Ser lie Ala Ala lie Phe Pro Phe lie Leu Ala 



get gtg etc cat aat tgg aag aat gta ttg ctg ctg gee gat gee atg 
Ala Val Leu His Asn Trp Lys Asn Val Leu Leu Leu Ala Asp Ala Met 



gtg gec eta eta ata acc att ctg ggc eta ttc aag ttt age atg ata 
Val Ala Leu Leu lie Thr lie Leu Gly Leu Phe Lys Phe Ser Met lie 
65 70 75 80 

ctt tac tta cgt cgc gat ttc aag cga ctg att gac aaa ttt cgt ttg 
Leu Tyr Leu Arg Arg Asp Phe Lys Arg Leu He Asp Lys Phe Arg Leu 
85 90 95 

etc atg teg aat gag gcg gaa cag ggc gag gaa tac gee gag att etc 
Leu Met Ser Asn Glu Ala Glu Gin Gly Glu Glu Tyr Ala Glu He Leu 
100 105 110 

aac gca gca aac aag cag gat caa cga atg tgc act ctg ttt agg act 
Asn Ala Ala Asn Lys Gin Asp Gin Arg Met Cys Thr Leu Phe Arg Thr 
115 120 125 

tgt ttc etc etc gee tgg gec ttg aat agt gtt ctg ccc etc gtg aga 
Cys Phe Leu Leu Ala Trp Ala Leu Asn Ser Val Leu Pro Leu Val Arg 
130 135 140 

atg ggt etc age tat tgg tta gca ggt cat gca gag ccc gag ttg cct 



44 



Met Gly Leu Ser Tyr Trp Leu Ala Gly His Ala Glu Pro Glu Leu Pro 
145 150 155 160 



ttt ccc tgt ctt ttt ccc tgg aat ate cac ate att cgc aat tat gtt 
Phe Pro Cys Leu Phe Pro Trp Asn lie His lie lie Arg Asn Tyr Val 
165 170 175 

ttg age ttc ate tgg age get ttc gee teg aca ggt gtg gtt tta cct 
Leu Ser Phe He Trp Ser Ala Phe Ala Ser Thr Gly Val Val Leu Pro 
180 185 190 

get gtc age ttg gat ace ata ttc tgt tec ttc ace age aac ctg tgc 
Ala Val Ser Leu Asp Thr He Phe Cys Ser Phe Thr Ser Asn Leu Cys 
195 200 205 

gee ttc ttc aaa att gcg cag tac aag gtg gtt aga ttt aag ggc gga 
Ala Phe Phe Lys He Ala Gin Tyr Lys Val Val Arg Phe Lys Gly Gly 
210 215 220 

tec ctt aaa gaa tea cag gec aca ttg aac aaa gtc ttt gec ctg tac 
Ser Leu Lys Glu Ser Gin Ala Thr Leu Asn Lys Val Phe Ala Leu Tyr 
225 230 235 240 

cag acc age ttg gat atg tgc aac gat ctg aat cag tgc tac caa ccg 
Gin Thr Ser Leu Asp Met Cys Asn Asp Leu Asn Gin Cys Tyr Gin Pro 
245 250 255 

att ate tgc gec cag ttc ttc att tea tct ctg caa etc tgc atg ctg 
He He Cys Ala Gin Phe Phe He Ser Ser Leu Gin Leu Cys Met Leu 
260 265 270 

gga tat ctg ttc tec att act ttt gee cag aca gag ggc gtc tac tat 
Gly Tyr Leu Phe Ser lie Thr Phe Ala Gin Thr Glu Gly Val Tyr Tyr 
275 280 285 

gec tea ttc ata gec aca ate att ata caa gec tat ate tac tgc tac 
Ala Ser Phe He Ala Thr He He He Gin Ala Tyr He Tyr Cys Tyr 
290 295 300 

tgc ggg gag aac ctg aag acg gag agt gec age ttc gag tgg gec ate 
Cys Gly Glu Asn Leu Lys Thr Glu Ser Ala Ser Phe Glu Trp Ala He 
305 310 315 320 

tac gac agt ccg tgg cac gag agt ttg ggt get ggt gga gee tct acc 
Tyr Asp Ser Pro Trp His Glu Ser Leu Gly Ala Gly Gly Ala Ser Thr 
325 330 335 

teg ate tgc cga tec ttg ctg ate age atg atg egg get cat egg gga 



45 



Ser lie Cys Arg Ser Leu 
340 

ttc cgc att acg gga tac 
Phe Arg lie Thr Gly Tyr 
355 

teg att gtt cgc acg gcg 
Ser He Val Arg Thr Ala 
370 

tec taa 

Ser 

385 



Leu He Ser Met Met Arg 
345 

ttt ttc gag gca aac atg 
Phe Phe Glu Ala Asn Met 
360 

atg tec tac ate aca atg 
Met Ser Tyr He Thr Met 
375 380 



Ala His Arg Gly 
350 

gag gee ttc tea 1104 

Glu Ala Phe Ser 

365 

ctg aga tea ttc 1152 
Leu Arg Ser Phe 



1158 



<210> 24 
<211> 385 
<212> PRT 

<213> Drosophila melanogaster 
<400> 24 

Met Asp Ser Phe Leu Gin Val Gin Lys Ser Thr He Ala Leu Leu Gly 



Phe Asp Leu Phe Ser Glu Asn Arg Glu Met Trp Lys Arg Pro Tyr Arg 
20 25 30 

Ala Met Asn Val Phe Ser He Ala Ala He Phe Pro Phe He Leu Ala 



Ala Val Leu His Asn Trp Lys Asn Val Leu Leu Leu Ala Asp Ala Met 
50 55 60 

Val Ala Leu Leu He Thr He Leu Gly Leu Phe Lys Phe Ser Met He 
65 70 75 80 

Leu Tyr Leu Arg Arg Asp Phe Lys Arg Leu He Asp Lys Phe Arg Leu 



Leu Met Ser Asn Glu Ala Glu Gin Gly Glu Glu Tyr Ala Glu He Leu 
100 105 110 

Asn Ala Ala Asn Lys Gin Asp Gin Arg Met Cys Thr Leu Phe Arg Thr 
115 120 125 

Cys Phe Leu Leu Ala Trp Ala Leu Asn Ser Val Leu Pro Leu Val Arg 
130 135 140 



Met Gly Leu Ser Tyr Trp Leu Ala Gly His Ala Glu Pro Glu Leu Pro 
145 150 155 160 



Phe Pro Cys Leu Phe Pro Trp Asn lie His lie lie Arg Asn Tyr Val 
165 170 175 

Leu Ser Phe lie Trp Ser Ala Phe Ala Ser Thr Gly Val Val Leu Pro 
180 185 190 

Ala Val Ser Leu Asp Thr lie Phe Cys Ser Phe Thr Ser Asn Leu Cys 
195 200 205 

Ala Phe Phe Lys lie Ala Gin Tyr Lys Val Val Arg Phe Lys Gly Gly 
210 215 220 

Ser Leu Lys Glu Ser Gin Ala Thr Leu Asn Lys Val Phe Ala Leu Tyr 
225 230 235 240 

Gin Thr Ser Leu Asp Met Cys Asn Asp Leu Asn Gin Cys Tyr Gin Pro 
245 250 255 

lie lie Cys Ala Gin Phe Phe lie Ser Ser Leu Gin Leu Cys Met Leu 
260 265 270 

Gly Tyr Leu Phe Ser lie Thr Phe Ala Gin Thr Glu Gly Val Tyr Tyr 
275 280 285 

Ala Ser Phe lie Ala Thr lie lie lie Gin Ala Tyr lie Tyr Cys Tyr 
290 295 300 

Cys Gly Glu Asn Leu Lys Thr Glu Ser Ala Ser Phe Glu Trp Ala lie 
305 310 315 320 

Tyr Asp Ser Pro Trp His Glu Ser Leu Gly Ala Gly Gly Ala Ser Thr 
325 330 335 

Ser lie Cys Arg Ser Leu Leu He Ser Met Met Arg Ala His Arg Gly 
340 345 350 

Phe Arg He Thr Gly Tyr Phe Phe Glu Ala Asn Met Glu Ala Phe Ser 
355 360 365 

Ser He Val Arg Thr Ala Met Ser Tyr He Thr Met Leu Arg Ser Phe 
370 375 380 

Ser 
385 
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<210> 25 
<211> 1203 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1200) 

<223> DOR 4 7E.2, a coding region on BDGP Clone No. 
AC005638 

<400> 25 

atg aac gac teg ggt tat caa tea aat etc age ctt ctg egg gtt ttt 48 
Met Asn Asp Ser Gly Tyr Gin Ser Asn Leu Ser Leu Leu Arg Val Phe 
15 10 15 

etc gac gag ttc cga teg gtt ctg egg cag gaa agt ccc ggt etc ate 96 
Leu Asp Glu Phe Arg Ser Val Leu Arg Gin Glu Ser Pro Gly Leu lie 
20 25 30 

cca cgc ctg get ttt tac tat gtt cgc gee ttt ctg age ttg ccc ctg 144 
Pro Arg Leu Ala Phe Tyr Tyr Val Arg Ala Phe Leu Ser Leu Pro Leu 
35 40 45 

tac cga tgg ate aac ttg ttc ate atg tgc aat gtg atg acc att ttc 192 
Tyr Arg Trp lie Asn Leu Phe lie Met Cys Asn Val Met Thr lie Phe 
50 55 60 

tgg acc atg ttc gtg gec ctg ccc gag teg aag aac gtg ate gaa atg 240 
Trp Thr Met Phe Val Ala Leu Pro Glu Ser Lys Asn Val lie Glu Met 
65 70 75 80 

ggc gac gac ttg gtt tgg att teg ggg atg gca ctg gtg ttc acc aag 288 
Gly Asp Asp Leu Val Trp lie Ser Gly Met Ala Leu Val Phe Thr Lys 
85 90 95 

ate ttt tac atg cat ttg cgt tgc gac gag ate gat gaa ctt att teg 336 
lie Phe Tyr Met His Leu Arg Cys Asp Glu lie Asp Glu Leu lie Ser 
100 105 110 

gat ttt gaa tac tac aac egg gag ctg aga ccc cat aat ate gat gag 384 
Asp Phe Glu Tyr Tyr Asn Arg Glu Leu Arg Pro His Asn lie Asp Glu 
115 120 125 



gag gtg ttg ggt tgg cag aga ctg tgc tac gtg ata gaa teg ggt eta 432 



Glu Val Leu Gly Trp Gin Arg Leu Cys Tyr Val lie Glu Ser Gly Leu 
130 135 140 



tat ate aac tgc ttt tgc ctg gtc aac ttc ttc agt gec get att ttc 480 
Tyr lie Asn Cys Phe Cys Leu Val Asn Phe Phe Ser Ala Ala lie Phe 
145 150 155 160 



ctg caa cct ctg ttg ggc gag gga aag ctg ccc ttc cac age gtc tat 528 
Leu Gin Pro Leu Leu Gly Glu Gly Lys Leu Pro Phe His Ser Val Tyr 
165 170 175 



ccg ttt caa tgg cat cgc ttg gat ctg cat ccc tac acg ttc tgg ttc 57 6 
Pro Phe Gin Trp His Arg Leu Asp Leu His Pro Tyr Thr Phe Trp Phe 
180 185 190 



etc tac ate tgg cag agt ctg ace teg cag cac aac eta atg age att 624 
Leu Tyr lie Trp Gin Ser Leu Thr Ser Gin His Asn Leu Met Ser lie 
195 200 205 



eta atg gtg gat atg gta 
Leu Met Val Asp Met Val 
210 



ggc att tec acg ttc 
Gly He Ser Thr Phe 
215 



etc cag acg gcg etc 672 

Leu Gin Thr Ala Leu 

220 



aat etc aag ttg ctt tgc ate gag ata agg aaa ctg ggg gac atg gag 720 
Asn Leu Lys Leu Leu Cys He Glu He Arg Lys Leu Gly Asp Met Glu 
225 230 235 240 



gtc agt gat aag agg ttc cac gag gag ttt tgt cgt gtg gtt cgc ttc 7 68 
Val Ser Asp Lys Arg Phe His Glu Glu Phe Cys Arg Val Val Arg Phe 
245 250 255 



cac cag cac att ate aaa ttg gtg ggg aaa gee aat aga get ttc aat 
His Gin His He He Lys Leu Val Gly Lys Ala Asn Arg Ala Phe Asn 
260 265 270 

ggc gec ttc aat gca caa tta atg gec agt ttc tec ctg att tec ata 
Gly Ala Phe Asn Ala Gin Leu Met Ala Ser Phe Ser Leu He Ser He 
275 280 285 

tec act ttc gag acc atg get gca gcg get gtg gat ccc aaa atg gec 
Ser Thr Phe Glu Thr Met Ala Ala Ala Ala Val Asp Pro Lys Met Ala 
290 295 300 

gec aag ttc gtg ctt etc atg ctg gtg gca ttc att caa ctg teg ctt 
Ala Lys Phe Val Leu Leu Met Leu Val Ala Phe He Gin Leu Ser Leu 
305 310 315 320 

tgg tgc gtc tct gga act ttg gtt tat act cag tea gtg gag gtg get 
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Trp Cys Val Ser Gly Thr Leu Val Tyr Thr Gin Ser Val Glu Val Ala 
325 330 335 



cag get get ttt gat ate aac gat tgg cac acc aaa teg cca ggc ate 
Gin Ala Ala Phe Asp lie Asm Asp Trp His Thr Lys Ser Pro Gly lie 
340 345 350 

cag agg gat ata tec ttt gtg ata eta cga gec cag aaa ccc ctg atg 
Gin Arg Asp lie Ser Phe Val lie Leu Arg Ala Gin Lys Pro Leu Met 
355 360 365 

tat gtg gee gaa cca ttt ctg ccc ttc acc ctg gga acc tat atg ctt 
Tyr Val Ala Glu Pro Phe Leu Pro Phe Thr Leu Gly Thr Tyr Met Leu 
370 375 380 

gtt ctg aag aac tgc tat cgt ttg ctg gec ctg atg caa gaa teg atg 
Val Leu Lys Asn Cys Tyr Arg Leu Leu Ala Leu Met Gin Glu Ser Met 
385 390 395 400 



<210> 26 
<211> 400 
<212> PRT 

<213> Drosophila melanogaster 
<400> 26 

Met Asn Asp Ser Gly Tyr Gin Ser Asn Leu Ser Leu Leu Arg Val Phe 
15 10 15 

Leu Asp Glu Phe Arg Ser Val Leu Arg Gin Glu Ser Pro Gly Leu lie 
20 25 30 

Pro Arg Leu Ala Phe Tyr Tyr Val Arg Ala Phe Leu Ser Leu Pro Leu 
35 40 45 

Tyr Arg Trp lie Asn Leu Phe lie Met Cys Asn Val Met Thr lie Phe 
50 55 60 

Trp Thr Met Phe Val Ala Leu Pro Glu Ser Lys Asn Val lie Glu Met 
65 70 75 80 

Gly Asp Asp Leu Val Trp lie Ser Gly Met Ala Leu Val Phe Thr Lys 
85 90 95 

lie Phe Tyr Met His Leu Arg Cys Asp Glu lie Asp Glu Leu lie Ser 
100 105 110 
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Asp Phe Glu Tyr Tyr Asn Arg Glu Leu Arg Pro His Asn lie Asp Glu 
115 120 125 



Glu Val Leu Gly Trp 
130 

Tyr lie Asn Cys Phe 
145 

Leu Gin Pro Leu Leu 
165 

Pro Phe Gin Trp His 
180 

Leu Tyr lie Trp Gin 
195 

Leu Met Val Asp Met 
210 

Asn Leu Lys Leu Leu 
225 

Val Ser Asp Lys Arg 
245 

His Gin His He He 
260 

Gly Ala Phe Asn Ala 
275 

Ser Thr Phe Glu Thr 
290 

Ala Lys Phe Val Leu 

305 

Trp Cys Val Ser Gly 
325 

Gin Ala Ala Phe Asp 
340 

Gin Arg Asp He Ser 
355 



Gin Arg Leu Cys Tyr Val 
135 

Cys Leu Val Asn Phe Phe 

150 155 

Gly Glu Gly Lys Leu Pro 
170 

Arg Leu Asp Leu His Pro 
185 

Ser Leu Thr Ser Gin His 
200 

Val Gly He Ser Thr Phe 
215 

Cys He Glu He Arg Lys 
230 235 

Phe His Glu Glu Phe Cys 

250 

Lys Leu Val Gly Lys Ala 
265 

Gin Leu Met Ala Ser Phe 
280 

Met Ala Ala Ala Ala Val 
295 

Leu Met Leu Val Ala Phe 
310 315 

Thr Leu Val Tyr Thr Gin 
330 

He Asn Asp Trp His Thr 
345 

Phe Val He Leu Arg Ala 
360 



He Glu Ser Gly Leu 
140 

Ser Ala Ala He Phe 
160 

Phe His Ser Val Tyr 
175 

Tyr Thr Phe Trp Phe 
190 

Asn Leu Met Ser He 
205 

Leu Gin Thr Ala Leu 
220 

Leu Gly Asp Met Glu 
240 

Arg Val Val Arg Phe 
255 

Asn Arg Ala Phe Asn 
270 

Ser Leu He Ser He 
285 

Asp Pro Lys Met Ala 
300 

He Gin Leu Ser Leu 
320 

Ser Val Glu Val Ala 
335 

Lys Ser Pro Gly He 
350 

Gin Lys Pro Leu Met 
365 
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Tyr Val Ala Glu Pro Phe Leu Pro Phe Thr Leu Gly Thr Tyr Met Leu 
370 375 380 



Val Leu Lys Asn Cys Tyr Arg Leu Leu Ala Leu Met Gin Glu Ser Met 
385 390 395 400 



<210> 27 
<211> 1140 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1137) 

<223> DOR 59D.1, a coding region on BDGP Clone No. 
AC005672 

<400> 27 

atg gca gag gtc aga gtg gac agt ctg gag ttt ttc aag age cat tgg 
Met Ala Glu Val Arg Val Asp Ser Leu Glu Phe Phe Lys Ser His Trp 
15 10 15 

acc gec tgg egg tac ttg gga gtg get cat ttt egg gtc gag aac tgg 
Thr Ala Trp Arg Tyr Leu Gly Val Ala His Phe Arg Val Glu Asn Trp 



aag aac ctt tac gtg ttt tac age att gtg teg aat ctt etc gtg acc 
Lys Asn Leu Tyr Val Phe Tyr Ser lie Val Ser Asn Leu Leu Val Thr 
35 40 45 

ctg tgc tac ccc gtt cac ctg gga ata tec etc ttt cgc aac cgc acc 
Leu Cys Tyr Pro Val His Leu Gly He Ser Leu Phe Arg Asn Arg Thr 
50 55 60 

ate acc gag gac ate etc aac ctg acc acc ttt gcg acc tgc aca gec 
He Thr Glu Asp He Leu Asn Leu Thr Thr Phe Ala Thr Cys Thr Ala 
65 70 75 80 

tgt teg gtg aag tgc ctg etc tac gee tac aac ate aag gat gtg ctg 
Cys Ser Val Lys Cys Leu Leu Tyr Ala Tyr Asn He Lys Asp Val Leu 
85 90 95 

gag atg gag egg ctg ttg agg ctt ttg gat gaa cgc gtc gtg ggt ccg 
Glu Met Glu Arg Leu Leu Arg Leu Leu Asp Glu Arg Val Val Gly Pro 
100 105 110 
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gag caa cgc age ate tac gga caa gtg agg gtc cag ctg cga aat gtg 
Glu Gin Arg Ser lie Tyr Gly Gin Val Arg Val Gin Leu Arg Asn Val 
115 120 125 



384 



eta tac gtg ttc ate ggc ate tac atg ccg tgt gee ctg ttc gec gag 
Leu Tyr Val Phe lie Gly lie Tyr Met Pro Cys Ala Leu Phe Ala Glu 
130 135 140 

eta tec ttt ctg ttc aag gag gag cgc ggt ctg atg tat ccc gee tgg 
Leu Ser Phe Leu Phe Lys Glu Glu Arg Gly Leu Met Tyr Pro Ala Trp 
145 150 155 160 

ttt ccc ttc gac tgg ctg cac tec ace agg aac tat tac ata gcg aac 
Phe Pro Phe Asp Trp Leu His Ser Thr Arg Asn Tyr Tyr lie Ala Asn 
165 170 175 

gec tat cag ata gtg ggc ate teg ttt cag ctg ctg caa aac tat gtt 
Ala Tyr Gin lie Val Gly lie Ser Phe Gin Leu Leu Gin Asn Tyr Val 
180 185 190 

age gac tgc ttt ccg gcg gtg gtg ctg tgc ctg ate tea tec cac ate 
Ser Asp Cys Phe Pro Ala Val Val Leu Cys Leu lie Ser Ser His lie 
195 200 205 

aaa atg ttg tac aac aga ttc gag gag gtg ggc ctg gat cca gec aga 
Lys Met Leu Tyr Asn Arg Phe Glu Glu Val Gly Leu Asp Pro Ala Arg 
210 215 220 

gat gcg gag aag gac ctg gag gee tgc ate acc gat cac aag cat att 
Asp Ala Glu Lys Asp Leu Glu Ala Cys lie Thr Asp His Lys His lie 
225 230 235 240 

eta gaa eta ttc cga cgc ate gag gee ttc att tec ctg ccc atg eta 
Leu Glu Leu Phe Arg Arg lie Glu Ala Phe lie Ser Leu Pro Met Leu 
245 250 255 

att cag ttc aca gtg acc gec ttg aat gtg tgc ate ggt tta gca gec 
lie Gin Phe Thr Val Thr Ala Leu Asn Val Cys lie Gly Leu Ala Ala 
260 265 270 

ctg gtg ttt ttc gtc age gag ccc atg gca egg atg tac ttc ate ttc 
Leu Val Phe Phe Val Ser Glu Pro Met Ala Arg Met Tyr Phe lie Phe 
275 280 285 

tac tec ctg gee atg ccg ctg cag ate ttt ccg tec tgc ttt ttc ggc 
Tyr Ser Leu Ala Met Pro Leu Gin lie Phe Pro Ser Cys Phe Phe Gly 
290 295 300 
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acc gac aac gag tac tgg ttc gga cgc etc cac tac gcg gec ttc agt 960 
Thr Asp Asn Glu Tyr Trp Phe Gly Arg Leu His Tyr Ala Ala Phe Ser 
305 310 315 320 

tgc aat tgg cac aca cag aac agg age ttt aag egg aaa atg atg ctg 10O8 
Cys Asn Trp His Thr Gin Asn Arg Ser Phe Lys Arg Lys Met Met Leu 
325 330 335 

ttc gtt gag caa teg ttg aag aag age acc get gtg get ggc gga atg 1056 
Phe Val Glu Gin Ser Leu Lys Lys Ser Thr Ala Val Ala Gly Gly Met 
340 345 350 

atg cgt ate cac ctg gac acg ttc ttt tec acc eta aag ggg gec tac 1104 
Met Arg lie His Leu Asp Thr Phe Phe Ser Thr Leu Lys Gly Ala Tyr 
355 360 365 



tec etc ttt acc ate att att egg atg aga aag tag 
Ser Leu Phe Thr lie lie lie Arg Met Arg Lys 
370 375 



<210> 28 
<211> 379 
<212> PRT 

<213> Drosophila melanogaster 
<400> 28 

Met Ala Glu Val Arg Val Asp Ser Leu Glu Phe Phe Lys Ser His Trp 
15 10 15 

Thr Ala Trp Arg Tyr Leu Gly Val Ala His Phe Arg Val Glu Asn Trp 



Lys Asn Leu Tyr Val Phe Tyr Ser lie Val Ser Asn Leu Leu Val Thr 
35 40 45 

Leu Cys Tyr Pro Val His Leu Gly lie Ser Leu Phe Arg Asn Arg Thr 
50 55 60 

lie Thr Glu Asp lie Leu Asn Leu Thr Thr Phe Ala Thr Cys Thr Ala 
65 70 75 80 

Cys Ser Val Lys Cys Leu Leu Tyr Ala Tyr Asn lie Lys Asp Val Leu 



Glu Met Glu Arg Leu Leu Arg Leu Leu Asp Glu Arg Val Val Gly Pro 
100 105 110 
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Glu Gin Arg Ser He Tyr Gly Gin Val Arg Val Gin Leu Arg Asn Val 
115 120 125 



Leu Tyr Val Phe lie 
130 

Leu Ser Phe Leu Phe 

145 

Phe Pro Phe Asp Trp 
165 

Ala Tyr Gin He Val 
180 

Ser Asp Cys Phe Pro 
195 

Lys Met Leu Tyr Asn 
210 

Asp Ala Glu Lys Asp 
225 

Leu Glu Leu Phe Arg 
245 

He Gin Phe Thr Val 
260 

Leu Val Phe Phe Val 
275 

Tyr Ser Leu Ala Met 
290 

Thr Asp Asn Glu Tyr 

305 

Cys Asn Trp His Thr 
325 

Phe Val Glu Gin Ser 
340 

Met Arg lie His Leu 
355 



Gly He Tyr Met Pro Cys 
135 

Lys Glu Glu Arg Gly Leu 

150 155 

Leu His Ser Thr Arg Asn 

170 

Gly He Ser Phe Gin Leu 
185 

Ala Val Val Leu Cys Leu 
200 

Arg Phe Glu Glu Val Gly 
215 

Leu Glu Ala Cys He Thr 
230 235 

Arg He Glu Ala Phe He 
250 

Thr Ala Leu Asn Val Cys 
265 

Ser Glu Pro Met Ala Arg 
280 

Pro Leu Gin He Phe Pro 
295 

Trp Phe Gly Arg Leu His 
310 315 

Gin Asn Arg Ser Phe Lys 
330 

Leu Lys Lys Ser Thr Ala 
345 

Asp Thr Phe Phe Ser Thr 
360 



Ala Leu Phe Ala Glu 
140 

Met Tyr Pro Ala Trp 

- 160 

Tyr Tyr He Ala Asn 
175 

Leu Gin Asn Tyr Val 
190 

He Ser Ser His He 
205 

Leu Asp Pro Ala Arg 
220 

Asp His Lys His He 
240 

Ser Leu Pro Met Leu 
255 

He Gly Leu Ala Ala 
270 

Met Tyr Phe He Phe 
285 

Ser Cys Phe Phe Gly 
300 

Tyr Ala Ala Phe Ser 
320 

Arg Lys Met Met Leu 
335 

Val Ala Gly Gly Met 
350 

Leu Lys Gly Ala Tyr 
365 



55 



Ser Leu Phe Thr lie lie lie Arg Met Arg Lys 
370 375 



<210> 29 
<211> 1194 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1194) 

<223> DOR 2F.1, coding region of NCBI Accession No. 
AL009195 

<400> 29 

atg gag aag caa gag gat ttc aaa ctg aac acc cac agt get gtg tac 

Met Glu Lys Gin Glu Asp Phe Lys Leu Asn Thr His Ser Ala Val Tyr 
15 10 15 

tac cac tgg cgc gtt tgg gag etc act ggc ctg atg cgt cct ccg ggc 
Tyr His Trp Arg Val Trp Glu Leu Thr Gly Leu Met Arg Pro Pro Gly 



gtt tea age ctg ctt tac gtg gta tac tec att acg gtc aac ttg gtg 
Val Ser Ser Leu Leu Tyr Val Val Tyr Ser He Thr Val Asn Leu Val 



gtc acc gtg ctg ttt ccc ttg age ttg ctg gec agg ctg ctg ttc acc 
Val Thr Val Leu Phe Pro Leu Ser Leu Leu Ala Arg Leu Leu Phe Thr 



acc aac atg gec gga ttg tgc gag aac ctg acc ata act att acc gat 
Thr Asn Met Ala Gly Leu Cys Glu Asn Leu Thr He Thr He Thr Asp 



att gtg gec aat ttg aag ttt gcg aat gtg tac atg gtg agg aag cag 
He Val Ala Asn Leu Lys Phe Ala Asn Val Tyr Met Val Arg Lys Gin 



etc cat gag att cgc tct etc eta agg etc atg gac get aga gec egg 
Leu His Glu He Arg Ser Leu Leu Arg Leu Met Asp Ala Arg Ala Arg 
100 105 110 

ctg gtg ggc gat ccc gag gag att tct gec ttg agg aag gaa gtg aat 
Leu Val Gly Asp Pro Glu Glu He Ser Ala Leu Arg Lys Glu Val Asn 



56 



115 



120 



125 



ate gca cag ggc act ttc cgc acc ttt gec agt att ttc gta ttt ggc 432 
lie Ala Gin Gly Thr Phe Arg Thr Phe Ala Ser He Phe Val Phe Gly 
130 135 140 



act act ttg agt tgc gtc cgc gtg gtc gtt cgc cca gat cga gag etc 480 
Thr Thr Leu Ser Cys Val Arg Val Val Val Arg Pro Asp Arg Glu Leu 
145 150 155 160 



ctg tat ccg gec tgg ttc ggc gtt gac tgg atg cac tec acc aga aac 
Leu Tyr Pro Ala Trp Phe Gly Val Asp Trp Met His Ser Thr Arg Asn 
165 170 175 



tat gtg etc ate aat ate tac cag etc ttc ggc ttg ata gtg cag get 57 6 
Tyr Val Leu He Asn He Tyr Gin Leu Phe Gly Leu He Val Gin Ala 
180 185 190 



ata cag aac tgc get agt gac tec tat ccg cct gcg ttt etc tgc ctg 624 
He Gin Asn Cys Ala Ser Asp Ser Tyr Pro Pro Ala Phe Leu Cys Leu 
195 200 205 



etc acg ggt cat atg cgt get ttg gag ctg agg gtg egg egg att ggc 672 
Leu Thr Gly His Met Arg Ala Leu Glu Leu Arg Val Arg Arg He Gly 
210 215 220 



tgc agg acg gaa aag tec aat aaa ggg cag aca tat gaa gec tgg egg 720 
Cys Arg Thr Glu Lys Ser Asn Lys Gly Gin Thr Tyr Glu Ala Trp Arg 
225 230 235 240 



gag gag gtg tac cag gaa etc ate gag tgc ate cgc gat ctg gcg egg 7 68 
Glu Glu Val Tyr Gin Glu Leu He Glu Cys He Arg Asp Leu Ala Arg 
245 250 255 



gtc cat egg ctg agg gag ate att cag egg gtc ctt tea gtg ccc tgc 816 
Val His Arg Leu Arg Glu He He Gin Arg Val Leu Ser Val Pro Cys 
260 265 270 



atg gee cag ttc gtc tgc tec gee gee gtc cag tgt acc gtc gec atg 
Met Ala Gin Phe Val Cys Ser Ala Ala Val Gin Cys Thr Val Ala Met 
275 280 285 



cac ttc ctg tac gta gcg gat gac cac gac cac acc gec atg ate ate 912 
His Phe Leu Tyr Val Ala Asp Asp His Asp His Thr Ala Met He He 
290 295 300 



teg att gta ttt ttc teg gec gtc acc ttg gag gtg ttt gta ate tgc 960 
Ser He Val Phe Phe Ser Ala Val Thr Leu Glu Val Phe Val He Cys 
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305 



310 



315 



320 



tat ttt ggg gac agg atg egg aca cag age gag gcg ctg tgc gat gec 
Tyr Phe Gly Asp Arg Met Arg Thr Gin Ser Glu Ala Leu Cys Asp Ala 
325 330 335 

ttc tac gat tgc aac tgg ata gaa cag ctg ccc aag ttc aag cgc gaa 
Phe Tyr Asp Cys Asn Trp lie Glu Gin Leu Pro Lys Phe Lys Arg Glu 
340 345 350 

ctg etc ttc acc ctg gee agg acg cag egg cct tct ctt ate tac gca 
Leu Leu Phe Thr Leu Ala Arg Thr Gin Arg Pro Ser Leu lie Tyr Ala 
355 360 365 

ggc aac tac ate gca etc teg ctg gag acc ttc gag cag cag gtc atg 
Gly Asn Tyr He Ala Leu Ser Leu Glu Thr Phe Glu Gin Gin Val Met 
370 375 380 



agg ttc aca tac tct gtt ttc aca etc ttg ctg agg gec aag 
Arg Phe Thr Tyr Ser Val Phe Thr Leu Leu Leu Arg Ala Lys 
385 390 395 



<210> 30 
<211> 398 
<212> PRT 

<213> Drosophila melanogaster 
<400> 30 

Met Glu Lys Gin Glu Asp Phe Lys Leu Asn Thr His Ser Ala Val Tyr 
15 10 15 

Tyr His Trp Arg Val Trp Glu Leu Thr Gly Leu Met Arg Pro Pro Gly 
20 25 30 

Val Ser Ser Leu Leu Tyr Val Val Tyr Ser He Thr Val Asn Leu Val 
35 40 45 

Val Thr Val Leu Phe Pro Leu Ser Leu Leu Ala Arg Leu Leu Phe Thr 
50 55 60 

Thr Asn Met Ala Gly Leu Cys Glu Asn Leu Thr He Thr He Thr Asp 
65 70 75 80 

He Val Ala Asn Leu Lys Phe Ala Asn Val Tyr Met Val Arg Lys Gin 
85 90 95 

Leu His Glu He Arg Ser Leu Leu Arg Leu Met Asp Ala Arg Ala Arg 
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100 



105 



110 



Leu Val Gly Asp 
115 

He Ala Gin Gly 
130 

Thr Thr Leu Ser 
145 

Leu Tyr Pro Ala 



Tyr Val Leu He 
180 

He Gin Asn Cys 
195 

Leu Thr Gly His 
210 

Cys Arg Thr Glu 
225 

Glu Glu Val Tyr 



Val His Arg Leu 
260 

Met Ala Gin Phe 
275 

His Phe Leu Tyr 
290 

Ser He Val Phe 
305 

Tyr Phe Gly Asp 



Phe Tyr Asp Cys 
340 

Leu Leu Phe Thr 



Pro Glu Glu He 
120 

Thr Phe Arg Thr 
135 

Cys Val Arg Val 
150 

Trp Phe Gly Val 
165 

Asn lie Tyr Gin 



Ala Ser Asp Ser 
200 

Met Arg Ala Leu 
215 

Lys Ser Asn Lys 
230 

Gin Glu Leu He 
245 

Arg Glu He He 



Val Cys Ser Ala 
280 

Val Ala Asp Asp 
295 

Phe Ser Ala Val 
310 

Arg Met Arg Thr 
325 

Asn Trp He Glu 



Leu Ala Arg Thr 



Ser Ala Leu Arg 



Phe Ala Ser He 
140 

Val Val Arg Pro 
155 

Asp Trp Met His 

170 

Leu Phe Gly Leu 
185 

Tyr Pro Pro Ala 



Glu Leu Arg Val 
220 

Gly Gin Thr Tyr 
235 

Glu Cys He Arg 
250 

Gin Arg Val Leu 
265 

Ala Val Gin Cys 



His Asp His Thr 
300 

Thr Leu Glu Val 
315 

Gin Ser Glu Ala 
330 

Gin Leu Pro Lys 
345 

Gin Arg Pro Ser 



Lys Glu Val Asn 
125 

Phe Val Phe Gly 



Asp Arg Glu Leu 
160 

Ser Thr Arg Asn 
175 

He Val Gin Ala 
190 

Phe Leu Cys Leu 
205 

Arg Arg He Gly 



Glu Ala Trp Arg 
240 

Asp Leu Ala Arg 
255 

Ser Val Pro Cys 
270 

Thr Val Ala Met 
285 

Ala Met He He 



Phe Val He Cys 
320 

Leu Cys Asp Ala 
335 

Phe Lys Arg Glu 
350 

Leu He Tyr Ala 
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355 



360 



365 



Gly Asn Tyr lie Ala Leu Ser Leu Glu Thr Phe Glu Gin Gin Val Met 
370 375 380 

Arg Phe Thr Tyr Ser Val Phe Thr Leu Leu Leu Arg Ala Lys 
385 390 395 



<210> 31 
<211> 1191 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1191) 

<223> DOR 22A.1, a coding region of BDGP Clone No. 
AC004121 



atg tta age aag ttt ttt ccc cac ata aaa gaa aag cca ttg age gag 
Met Leu Ser Lys Phe Phe Pro His lie Lys Glu Lys Pro Leu Ser Glu 



egg gtt aag tec cga gat gec ttc att tac ttg gat egg gtg atg tgg 
Arg Val Lys Ser Arg Asp Ala Phe lie Tyr Leu Asp Arg Val Met Trp 
20 25 30 

tec ttt ggc tgg aca gag cct gaa aac aaa agg tgg ate ctt cct tat 
Ser Phe Gly Trp Thr Glu Pro Glu Asn Lys Arg Trp lie Leu Pro Tyr 



aaa ctg tgg tta gcg ttc gtg aac ata gta atg etc ate ctt ctg ccg 
Lys Leu Trp Leu Ala Phe Val Asn lie Val Met Leu lie Leu Leu Pro 



ate teg ata age ate gag tac etc cac cga ttt aaa acc ttc teg gcg 
lie Ser lie Ser lie Glu Tyr Leu His Arg Phe Lys Thr Phe Ser Ala 



ggg gag ttc ctt agt tec etc gag att gga gtc aac atg tac gga age 
Gly Glu Phe Leu Ser Ser Leu Glu lie Gly Val Asn Met Tyr Gly Ser 



tct ttt aag tgc gec ttc acc ttg att gga ttc aag aaa aga cag gaa 
Ser Phe Lys Cys Ala Phe Thr Leu lie Gly Phe Lys Lys Arg Gin Glu 



60 



100 



105 



110 



get aag gtt tta ctg gat cag ctg gac aag aga tgc ctt age gat aag 
Ala Lys Val Leu Leu Asp Gin Leu Asp Lys Arg Cys Leu Ser Asp Lys 
115 120 125 

gag agg tec act gtt cat cgc tat gtc gec atg gga aac ttt ttc gat 
Glu Arg Ser Thr Val His Arg Tyr Val Ala Met Gly Asn Phe Phe Asp 
130 135 140 

att ttg tat cac att ttt tac tec acc ttc gtg gta atg aac ttc ccg 
lie Leu Tyr His lie Phe Tyr Ser Thr Phe Val Val Met Asn Phe Pro 
145 150 155 160 

tat ttt ctg ctt gag aga cgc cat get tgg cgc atg tac ttt cca tat 
Tyr Phe Leu Leu Glu Arg Arg His Ala Trp Arg Met Tyr Phe Pro Tyr 
165 170 175 

ate gat tec gac gaa cag ttt tac ate tec age ate gee gag tgt ttt 
lie Asp Ser Asp Glu Gin Phe Tyr lie Ser Ser lie Ala Glu Cys Phe 
180 185 190 

ctg atg acg gag gec ate tac atg gat etc tgt acg gac gtg tgt ccc 
Leu Met Thr Glu Ala lie Tyr Met Asp Leu Cys Thr Asp Val Cys Pro 
195 200 205 

ttg ate tec atg ctt atg get cga tgc cac att age etc ctg aaa cag 
Leu lie Ser Met Leu Met Ala Arg Cys His lie Ser Leu Leu Lys Gin 
210 215 220 

cga ctg aga aat etc cga teg aag cca gga agg acc gaa gat gag tac 
Arg Leu Arg Asn Leu Arg Ser Lys Pro Gly Arg Thr Glu Asp Glu Tyr 
225 230 235 240 

ttg gag gag etc acc gag tgc att egg gat cat cga ttg eta ttg gac 
Leu Glu Glu Leu Thr Glu Cys lie Arg Asp His Arg Leu Leu Leu Asp 
245 250 255 

tat gtt gac gca ttg cga ccc gtc ttt teg gga acc att ttt gtg cag 
Tyr Val Asp Ala Leu Arg Pro Val Phe Ser Gly Thr lie Phe Val Gin 
260 265 270 

ttc etc ctg ate ggt act gta ctg ggt etc tea atg ata aat eta atg 
Phe Leu Leu lie Gly Thr Val Leu Gly Leu Ser Met lie Asn Leu Met 
275 280 285 

ttc ttc teg aca ttt tgg act ggt gtc gec act tgc ctt ttt atg ttc 
Phe Phe Ser Thr Phe Trp Thr Gly Val Ala Thr Cys Leu Phe Met Phe 
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gac gtg tec atg gag acg ttc ccc ttt tgc tat ttg tgc aac atg att 960 

Asp Val Ser Met Glu Thr Phe Pro Phe Cys Tyr Leu Cys Asn Met lie 

305 310 315 320 

ate gat gac tgc cag gaa atg tec aat tgc etc ttt caa teg gac tgg 1008 

lie Asp Asp Cys Gin Glu Met Ser Asn Cys Leu Phe Gin Ser Asp Trp 
325 330 335 

acc tct gec gat cgt cgc tac aaa tec acg ttg gta tac ttt ctt cac 1056 

Thr Ser Ala Asp Arg Arg Tyr Lys Ser Thr Leu Val Tyr Phe Leu His 
340 345 350 

aat ctt cag caa ccc att act etc acg get ggt gga gtg ttt cct att 1104 

Asn Leu Gin Gin Pro lie Thr Leu Thr Ala Gly Gly Val Phe Pro lie 

355 360 365 

tec atg caa aca aat ttg get atg gtg aag ctg gca ttt tct gtg gtt 1152 

Ser Met Gin Thr Asn Leu Ala Met Val Lys Leu Ala Phe Ser Val Val 

370 375 380 

acg gta att aag caa ttt aac ttg gec gaa agg ttt caa 1191 

Thr Val He Lys Gin Phe Asn Leu Ala Glu Arg Phe Gin 

385 390 395 



<210> 32 
<211> 397 
<212> PRT 

<213> Drosophila melanogaster 
<400> 32 

Met Leu Ser Lys Phe Phe Pro His He Lys Glu Lys Pro Leu Ser Glu 



Arg Val Lys Ser Arg Asp Ala Phe He Tyr Leu Asp Arg Val Met Trp 



Ser Phe Gly Trp Thr Glu Pro Glu Asn Lys Arg Trp He Leu Pro Tyr 



Lys Leu Trp Leu Ala Phe Val Asn He Val Met Leu He Leu Leu Pro 
50 55 60 

He Ser He Ser He Glu Tyr Leu His Arg Phe Lys Thr Phe Ser Ala 



62 



Gly Glu Phe Leu Ser Ser Leu Glu lie Gly Val Asn Met Tyr Gly Ser 
85 90 95 



Ser Phe Lys Cys Ala 
100 

Ala Lys Val Leu Leu 
115 

Glu Arg Ser Thr Val 
130 

lie Leu Tyr His lie 
145 

Tyr Phe Leu Leu Glu 
165 

lie Asp Ser Asp Glu 
180 

Leu Met Thr Glu Ala 
195 

Leu lie Ser Met Leu 
210 

Arg Leu Arg Asn Leu 
225 

Leu Glu Glu Leu Thr 
245 

Tyr Val Asp Ala Leu 
260 

Phe Leu Leu lie Gly 
275 

Phe Phe Ser Thr Phe 
290 

Asp Val Ser Met Glu 
305 

lie Asp Asp Cys Gin 
325 



Phe Thr Leu lie Gly Phe 
105 

Asp Gin Leu Asp Lys Arg 
120 

His Arg Tyr Val Ala Met 
135 

Phe Tyr Ser Thr Phe Val 
150 155 

Arg Arg His Ala Trp Arg 
170 

Gin Phe Tyr lie Ser Ser 
185 

lie Tyr Met Asp Leu Cys 
200 

Met Ala Arg Cys His lie 
215 

Arg Ser Lys Pro Gly Arg 
230 235 

Glu Cys lie Arg Asp His 
250 

Arg Pro Val Phe Ser Gly 
265 

Thr Val Leu Gly Leu Ser 
280 

Trp Thr Gly Val Ala Thr 
295 

Thr Phe Pro Phe Cys Tyr 
310 315 

Glu Met Ser Asn Cys Leu 
330 



Lys Lys Arg Gin Glu 
110 

Cys Leu Ser Asp Lys 
125 

Gly Asn Phe Phe Asp 
140 

Val Met Asn Phe Pro 
160 

Met Tyr Phe Pro Tyr 
175 

lie Ala Glu Cys Phe 
190 

Thr Asp Val Cys Pro 
205 

Ser Leu Leu Lys Gin 
220 

Thr Glu Asp Glu Tyr 
240 

Arg Leu Leu Leu Asp 
255 

Thr lie Phe Val Gin 
270 

Met lie Asn Leu Met 
285 

Cys Leu Phe Met Phe 
300 

Leu Cys Asn Met lie 
320 

Phe Gin Ser Asp Trp 
335 



63 



Thr Ser Ala Asp Arg Arg Tyr Lys Ser Thr Leu Val Tyr Phe Leu His 
340 345 350 



Asn Leu Gin Gin Pro He Thr Leu Thr Ala Gly Gly Val Phe Pro He 

355 360 365 

Ser Met Gin Thr Asn Leu Ala Met Val Lys Leu Ala Phe Ser Val Val 

370 375 380 

Thr Val He Lys Gin Phe Asn Leu Ala Glu Arg Phe Gin 

385 390 395 



<210> 33 
<211> 1200 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1200) 

<223> DOR 36E. 1 

<400> 33 

atg gtt cgt tac gtg ccc egg ttc get gat ggt cag aaa gta aag ttg 4 8 

Met Val Arg Tyr Val Pro Arg Phe Ala Asp Gly Gin Lys Val Lys Leu 

15 10 15 

get tgg ccc ttg gcg gtt ttt egg tta aat cac ata ttc tgg cca ttg 96 
Ala Trp Pro Leu Ala Val Phe Arg Leu Asn His He Phe Trp Pro Leu 
20 25 30 

gat ccg age aca ggg aaa tgg ggc cga tat ctg gac aag gtt eta get 14 4 
Asp Pro Ser Thr Gly Lys Trp Gly Arg Tyr Leu Asp Lys Val Leu Ala 



gtt gcg atg tec ttg gtt ttt atg caa cac aac gat gca gag ctg agg 192 
Val Ala Met Ser Leu Val Phe Met Gin His Asn Asp Ala Glu Leu Arg 



tac ttg cgc ttc gag gca agt aat egg aat ttg gat gec ttt etc aca 240 
Tyr Leu Arg Phe Glu Ala Ser Asn Arg Asn Leu Asp Ala Phe Leu Thr 
65 70 75 80 

gga atg cca acg tat tta ate etc gtg gag get caa ttt aga agt ctt 288 
Gly Met Pro Thr Tyr Leu He Leu Val Glu Ala Gin Phe Arg Ser Leu 
85 90 95 

64 



cac att eta ctg cac ttc gag aag ctt cag aag ttt tta gaa ata ttc 
His lie Leu Leu His Phe Glu Lys Leu Gin Lys Phe Leu Glu lie Phe 
100 105 110 

tac gca aat att tat att gat ccc cgt aag gaa ccc gaa atg ttt cga 
Tyr Ala Asn lie Tyr lie Asp Pro Arg Lys Glu Pro Glu Met Phe Arg 
115 120 125 

aaa gtg gat gga aag atg ata att aac aga tta gtt teg gec atg tac 
Lys Val Asp Gly Lys Met lie lie Asn Arg Leu Val Ser Ala Met Tyr 
130 135 140 

ggt gca gtt ate tct ctg tat eta ate gca ccc gtt ttt tec ate att 
Gly Ala Val He Ser Leu Tyr Leu He Ala Pro Val Phe Ser He He 
145 150 155 160 

aac caa age aaa gat ttt eta tac tct atg ate ttt ccg ttc gat teg 
Asn Gin Ser Lys Asp Phe Leu Tyr Ser Met He Phe Pro Phe Asp Ser 
165 170 175 

gat ccc ttg tac ata ttt gtg cca ctg ctt ttg aca aac gta tgg gtt 
Asp Pro Leu Tyr He Phe Val Pro Leu Leu Leu Thr Asn Val Trp Val 
180 185 190 

ggc att gta ata gat acc atg atg ttc ggg gag acg aat ttg ttg tgt 
Gly He Val He Asp Thr Met Met Phe Gly Glu Thr Asn Leu Leu Cys 
195 200 205 

gaa eta att gtc cac eta aat ggt agt tat atg ttg etc aag agg gac 
Glu Leu lie Val His Leu Asn Gly Ser Tyr Met Leu Leu Lys Arg Asp 
210 215 220 

ttg cag ttg gec att gaa aag ata tta gtt gca agg gac cgt ccg cat 
Leu Gin Leu Ala He Glu Lys He Leu Val Ala Arg Asp Arg Pro His 
225 230 235 240 

atg gec aaa cag eta aag gtt tta att aca aaa act etc cga aag aat 
Met Ala Lys Gin Leu Lys Val Leu He Thr Lys Thr Leu Arg Lys Asn 
245 250 255 

gtg get eta aat cag ttt ggc cag cag ctg gag get cag tat act gtg 
Val Ala Leu Asn Gin Phe Gly Gin Gin Leu Glu Ala Gin Tyr Thr Val 
260 265 270 

egg gtt ttt att atg ttt gca ttc get gcg ggc ctt tta tgt get ctt 
Arg Val Phe He Met Phe Ala Phe Ala Ala Gly Leu Leu Cys Ala Leu 
275 280 285 



65 



tct ttt aag get tat acg acg gat tec etc age aca atg tac tac ctt 
Ser Phe Lys Ala Tyr Thr Thr Asp Ser Leu Ser Thr Met Tyr Tyr Leu 
290 295 300 



912 



ace cat tgg gag caa ate ctg cag tac 
Thr His Trp Glu Gin lie Leu Gin Tyr 
305 310 

ctg cga tta eta aag etc att aac ttg 
Leu Arg Leu Leu Lys Leu lie Asn Leu 
325 

ccc ttc tat gtg aca ggg eta aaa tat 
Pro Phe Tyr Val Thr Gly Leu Lys Tyr 
340 345 

ggc tta aaa gta agt gaa aaa cga gtg 
Gly Leu Lys Val Ser Glu Lys Arg Val 
355 360 

tct ttc aca gat tct gca ggc ate ctt 
Ser Phe Thr Asp Ser Ala Gly lie Leu 
370 375 

ttc gat gca gcg acg aca aat gag caa 
Phe Asp Ala Ala Thr Thr Asn Glu Gin 
385 390 



tct aca aat ccc age gaa aat 960 
Ser Thr Asn Pro Ser Glu Asn 
315 320 

gee att gag atg aac age aag 1008 
Ala lie Glu Met Asn Ser Lys 
330 335 

ttt cgc gtt agt ctg cag get 1056 

Phe Arg Val Ser Leu Gin Ala 
350 

caa aac cat ttc act gtc age 1104 

Gin Asn His Phe Thr Val Ser 
365 

etc gta ctt cac att cct cac 1152 

Leu Val Leu His lie Pro His 
380 

tta aat aat tea cat ttt ttt 1200 

Leu Asn Asn Ser His Phe Phe 

395 400 



<210> 34 
<211> 400 
<212> PRT 

<213> Drosophila melanogaster 
<400> 34 

Met Val Arg Tyr Val Pro Arg Phe Ala Asp Gly Gin Lys Val Lys Leu 
15 10 15 

Ala Trp Pro Leu Ala Val Phe Arg Leu Asn His lie Phe Trp Pro Leu 
20 25 30 

Asp Pro Ser Thr Gly Lys Trp Gly Arg Tyr Leu Asp Lys Val Leu Ala 
35 40 45 

Val Ala Met Ser Leu Val Phe Met Gin His Asn Asp Ala Glu Leu Arg 

50 55 60 



66 



Tyr Leu Arg Phe Glu Ala Ser Asn Arg Asn Leu Asp Ala Phe Leu Thr 
65 70 75 80 



Gly Met Pro Thr Tyr Leu 
85 

His lie Leu Leu His Phe 
100 

Tyr Ala Asn lie Tyr 
115 

Lys Val Asp Gly Lys 
130 

Gly Ala Val He Ser 

145 

Asn Gin Ser Lys Asp 
165 

Asp Pro Leu Tyr He 
180 

Gly He Val He Asp 
195 

Glu Leu He Val His 
210 

Leu Gin Leu Ala He 
225 

Met Ala Lys Gin Leu 
245 

Val Ala Leu Asn Gin 
260 

Arg Val Phe He Met 
275 

Ser Phe Lys Ala Tyr 
290 

Thr His Trp Glu Gin 

305 



He Leu Val Glu Ala 
90 

Glu Lys Leu Gin Lys 

105 

He Asp Pro Arg Lys Glu 
120 

Met He He Asn Arg Leu 
135 

Leu Tyr Leu He Ala Pro 
150 155 

Phe Leu Tyr Ser Met He 
170 

Phe Val Pro Leu Leu Leu 
185 

Thr Met Met Phe Gly Glu 
200 

Leu Asn Gly Ser Tyr Met 
215 

Glu Lys He Leu Val Ala 
230 235 

Lys Val Leu He Thr Lys 

250 

Phe Gly Gin Gin Leu Glu 
265 

Phe Ala Phe Ala Ala Gly 
280 

Thr Thr Asp Ser Leu Ser 
295 

He Leu Gin Tyr Ser Thr 
310 315 



Gin Phe Arg Ser Leu 
95 

Phe Leu Glu He Phe 
110 

Pro Glu Met Phe Arg 
125 

Val Ser Ala Met Tyr 
140 

Val Phe Ser He He 
160 

Phe Pro Phe Asp Ser 
175 

Thr Asn Val Trp Val 
190 

Thr Asn Leu Leu Cys 
205 

Leu Leu Lys Arg Asp 
220 

Arg Asp Arg Pro His 
240 

Thr Leu Arg Lys Asn 
255 

Ala Gin Tyr Thr Val 
270 

Leu Leu Cys Ala Leu 
285 

Thr Met Tyr Tyr Leu 

300 

Asn Pro Ser Glu Asn 
320 



67 



Leu Arg Leu Leu 



Pro Phe Tyr Val 
340 

Gly Leu Lys Val 
355 

Ser Phe Thr Asp 
370 

Phe Asp Ala Ala 
385 



Lys Leu lie Asn 
325 

Thr Gly Leu Lys 



Ser Glu Lys Arg 
360 

Ser Ala Gly He 
375 

Thr Thr Asn Glu 
390 



Leu Ala He Glu 
330 

Tyr Phe Arg Val 
345 

Val Gin Asn His 



Leu Leu Val Leu 
380 

Gin Leu Asn Asn 
395 



Met Asn Ser Lys 
335 

Ser Leu Gin Ala 
350 

Phe Thr Val Ser 
365 

His He Pro His 



Ser His Phe Phe 
400 



<210> 35 
<211> 1197 
<212> DNA 

<213> Drosophila melanogaster 

<220> 
<221> CDS 
<222> (1) . . (1197) 
<223> DOR 41E.1 

<400> 35 

atg gtc ttc gag eta ata cgt ccc get 
Met Val Phe Glu Leu He Arg Pro Ala 
1 5 

tec cga gat ggt tgc ate tac ctt tac 
Ser Arg Asp Gly Cys He Tyr Leu Tyr 
20 25 

tgg ctg ccc ccc aag cag ggt gtg etc 
Trp Leu Pro Pro Lys Gin Gly Val Leu 
35 40 

acg eta atg acg ttc gtg tgg tgt aca 
Thr Leu Met Thr Phe Val Trp Cys Thr 
50 55 



ccg etc acg gag cag aag egg 48 
Pro Leu Thr Glu Gin Lys Arg 
10 15 

cgc gee atg aag ttt att gga 96 
Arg Ala Met Lys Phe He Gly 
30 

egg tat gtg tac etc acc tgg 144 
Arg Tyr Val Tyr Leu Thr Trp 
45 

acg tac ctg ccg ctt ggc ttc 192 
Thr Tyr Leu Pro Leu Gly Phe 
60 



ctt ggt age tac 
Leu Gly Ser Tyr 

65 



atg acg cag ate aag tec ttc 
Met Thr Gin He Lys Ser Phe 
70 75 



tec cct gga gag ttt 
Ser Pro Gly Glu Phe 
80 



240 



etc act tea etc cag gtg tgc att aat gec tac ggc tea teg gta aaa 
Leu Thr Ser Leu Gin Val Cys lie Asn Ala Tyr Gly Ser Ser Val Lys 
85 90 95 

gtt gca ate aca tac tec atg etc tgg cgc ctt ate aag gec aag aac 
Val Ala lie Thr Tyr Ser Met Leu Trp Arg Leu lie Lys Ala Lys Asn 
100 105 110 

att ttg gac cag ctg gac ctg cgc tgc acc gee atg gag gag cgc gaa 
lie Leu Asp Gin Leu Asp Leu Arg Cys Thr Ala Met Glu Glu Arg Glu 
115 120 125 

aag ate cac eta gtg gtg gec cgc age aac cat gec ttt etc ate ttc 
Lys lie His Leu Val Val Ala Arg Ser Asn His Ala Phe Leu lie Phe 
130 135 140 

acc ttt gtc tac tgc gga tat gee ggc tec acc tac ctg age teg gtt 
Thr Phe Val Tyr Cys Gly Tyr Ala Gly Ser Thr Tyr Leu Ser Ser Val 
145 150 155 160 

etc age ggg cgt ccg ccc tgg cag ctg tac aat ccc ttt att gat tgg 
Leu Ser Gly Arg Pro Pro Trp Gin Leu Tyr Asn Pro Phe He Asp Trp 
165 170 175 

cat gac ggc aca etc aag etc tgg gtg gec tec acg ttg gag tac atg 
His Asp Gly Thr Leu Lys Leu Trp Val Ala Ser Thr Leu Glu Tyr Met 
180 185 190 

gtg atg tea ggc gee gtt ctg cag gat caa etc teg gac tct tac cca 
Val Met Ser Gly Ala Val Leu Gin Asp Gin Leu Ser Asp Ser Tyr Pro 
195 200 205 

ttg ate tat acc etc ate ctt cgt get cac ttg gac atg eta agg gag 
Leu He Tyr Thr Leu He Leu Arg Ala His Leu Asp Met Leu Arg Glu 
210 215 220 

cgc ate cga cgc etc cgt tec gat gag aac ctg age gag gec gag age 
Arg He Arg Arg Leu Arg Ser Asp Glu Asn Leu Ser Glu Ala Glu Ser 
225 230 235 240 

tat gaa gag ctg gtc aaa tgt gtg atg gac cac aag etc att eta aga 
Tyr Glu Glu Leu Val Lys Cys Val Met Asp His Lys Leu He Leu Arg 
245 250 255 

tac tgc gcg att att aaa cca gta ate cag ggg acc ate ttc aca cag 
Tyr Cys Ala He He Lys Pro Val He Gin Gly Thr He Phe Thr Gin 
260 265 270 



69 



ttt ctg ctg ate ggc ctg gtt ctg ggc ttc acg ctg ate aac gtg ttt 864 
Phe Leu Leu lie Gly Leu Val Leu Gly Phe Thr Leu lie Asn Val Phe 
275 280 285 

ttc ttc tea gac ate tgg acg ggc ate gca tea ttt atg ttt gtt ata 912 
Phe Phe Ser Asp lie Trp Thr Gly He Ala Ser Phe Met Phe Val He 
290 295 300 

acc att ttg ctg cag acc ttc ccc ttc tgc tac aca tgc aac etc ate 960 
Thr He Leu Leu Gin Thr Phe Pro Phe Cys Tyr Thr Cys Asn Leu He 
305 310 315 320 

atg gag gac tgc gag tec ttg acc cat get att ttc cag tec aac tgg 1008 
Met Glu Asp Cys Glu Ser Leu Thr His Ala He Phe Gin Ser Asn Trp 
325 330 335 

gtg gat gec agt cgt cgc tac aaa aca aca eta ctg tat ttt etc caa 1056 
Val Asp Ala Ser Arg Arg Tyr Lys Thr Thr Leu Leu Tyr Phe Leu Gin 
340 345 350 

aac gtg cag cag cct ate gtt ttc att gca ggc ggt ate ttt cag ata 1104 
Asn Val Gin Gin Pro He Val Phe He Ala Gly Gly He Phe Gin He 
355 360 365 

tec atg age age aac ata agt gtg gca aag ttt get ttc tec gtg ata 1152 
Ser Met Ser Ser Asn He Ser Val Ala Lys Phe Ala Phe Ser Val He 
370 375 380 

acc att acc aag caa atg aat ata get gac aaa ttt aag acg gac 1197 
Thr He Thr Lys Gin Met Asn He Ala Asp Lys Phe Lys Thr Asp 
385 390 395 



<210> 36 
<211> 399 
<212> PRT 

<213> Drosophila melanogaster 

<400> 36 

Met Val Phe Glu Leu He Arg Pro Ala Pro Leu Thr Glu Gin Lys Arg 



Ser Arg Asp Gly Cys He Tyr Leu Tyr Arg Ala Met Lys Phe He Gly 
20 25 30 

Trp Leu Pro Pro Lys Gin Gly Val Leu Arg Tyr Val Tyr Leu Thr Trp 
35 40 45 



70 



Thr Leu Met Thr Phe Val Trp Cys Thr Thr Tyr Leu Pro Leu Gly Phe 
50 55 60 



Leu Gly Ser Tyr Met Thr Gin He Lys Ser Phe Ser Pro Gly Glu Phe 
65 70 75 80 

Leu Thr Ser Leu Gin Val Cys He Asn Ala Tyr Gly Ser Ser Val Lys 
85 90 95 

Val Ala He Thr Tyr Ser Met Leu Trp Arg Leu He Lys Ala Lys Asn 
100 105 110 

He Leu Asp Gin Leu Asp Leu Arg Cys Thr Ala Met Glu Glu Arg Glu 
115 120 125 

Lys He His Leu Val Val Ala Arg Ser Asn His Ala Phe Leu He Phe 
130 135 140 

Thr Phe Val Tyr Cys Gly Tyr Ala Gly Ser Thr Tyr Leu Ser Ser Val 
145 150 155 160 

Leu Ser Gly Arg Pro Pro Trp Gin Leu Tyr Asn Pro Phe He Asp Trp 
165 170 175 

His Asp Gly Thr Leu Lys Leu Trp Val Ala Ser Thr Leu Glu Tyr Met 
180 185 190 

Val Met Ser Gly Ala Val Leu Gin Asp Gin Leu Ser Asp Ser Tyr Pro 
195 200 205 

Leu lie Tyr Thr Leu He Leu Arg Ala His Leu Asp Met Leu Arg Glu 
210 215 220 

Arg He Arg Arg Leu Arg Ser Asp Glu Asn Leu Ser Glu Ala Glu Ser 
225 230 235 240 

Tyr Glu Glu Leu Val Lys Cys Val Met Asp His Lys Leu He Leu Arg 
245 250 255 

Tyr Cys Ala He He Lys Pro Val He Gin Gly Thr He Phe Thr Gin 
260 265 270 

Phe Leu Leu He Gly Leu Val Leu Gly Phe Thr Leu He Asn Val Phe 
275 280 285 

Phe Phe Ser Asp He Trp Thr Gly He Ala Ser Phe Met Phe Val He 
290 295 300 



71 



Thr lie Leu Leu 
305 

Met Glu Asp Cys 



Val Asp Ala Ser 
340 

Asn Val Gin Gin 
355 

Ser Met Ser Ser 
370 

Thr lie Thr Lys 
385 



Gin Thr Phe Pro 

310 

Glu Ser Leu Thr 
325 

Arg Arg Tyr Lys 



Pro lie Val Phe 
360 

Asn lie Ser Val 
375 

Gin Met Asn He 
390 



Phe Cys Tyr Thr 
315 

His Ala He Phe 
330 

Thr Thr Leu Leu 
345 

He Ala Gly Gly 



Ala Lys Phe Ala 
380 

Ala Asp Lys Phe 
395 



Cys Asn Leu He 

320 

Gin Ser Asn Trp 
335 

Tyr Phe Leu Gin 
350 

He Phe Gin He 
365 

Phe Ser Val He 



Lys Thr Asp 



<210> 37 
<211> 1218 
<212> DNA 

<213> Drosophila melanogaster 

<220> 
<221> CDS 
<222> (1) . . (1218) 
<223> DOR 41E.2 

<400> 37 

atg gat ctg cga agg tgg ttt ccg acc ttg tac acc cag teg aag gat 
Met Asp Leu Arg Arg Trp Phe Pro Thr Leu Tyr Thr Gin Ser Lys Asp 
15 10 15 

teg cca gtt cgc tec cga gac gcg acc ctg tac etc eta cgc tgc gtc 
Ser Pro Val Arg Ser Arg Asp Ala Thr Leu Tyr Leu Leu Arg Cys Val 



ttc tta atg ggc gtc cgc aag cca cct gec aag ttt ttc gtg gee tac 144 
Phe Leu Met Gly Val Arg Lys Pro Pro Ala Lys Phe Phe Val Ala Tyr 



gtg etc tgg tec ttc gca ctg aat ttc tgc tea aca ttt tat cag cca 192 
Val Leu Trp Ser Phe Ala Leu Asn Phe Cys Ser Thr Phe Tyr Gin Pro 



att ggc ttt etc aca ggc tat ata age cat tta tea gag ttc tec ccg 240 



72 



lie Gly Phe Leu Thr Gly Tyr lie Ser His Leu Ser Glu Phe Ser Pro 
65 70 75 80 



gga gag ttt eta act teg ctg cag gtg gec ttt aat get tgg tec tgc 
Gly Glu Phe Leu Thr Ser Leu Gin Val Ala Phe Asn Ala Trp Ser Cys 
85 90 95 

tct aca aaa gtc ctg ata gtg tgg gca eta gtt aag cgc ttt gac gag 
Ser Thr Lys Val Leu lie Val Trp Ala Leu Val Lys Arg Phe Asp Glu 
100 105 110 

get aat aac ctt etc gac gag atg gat agg cgt ate aca gac ccc gga 
Ala Asn Asn Leu Leu Asp Glu Met Asp Arg Arg lie Thr Asp Pro Gly 
115 120 125 

gag cgt ctt cag att cat cgc get gtc tec etc agt aac cgt ata ttc 
Glu Arg Leu Gin He His Arg Ala Val Ser Leu Ser Asn Arg He Phe 
130 135 140 

ttc ttt ttc atg gca gtc tac atg gtt tat gec act aat acg ttt ctg 
Phe Phe Phe Met Ala Val Tyr Met Val Tyr Ala Thr Asn Thr Phe Leu 
145 150 155 160 

teg gcg ate ttc att gga agg cca ccg tac caa aat tac tac cct ttt 
Ser Ala He Phe He Gly Arg Pro Pro Tyr Gin Asn Tyr Tyr Pro Phe 
165 170 175 

ctg gac tgg cga tct age act ctg cat eta get ctg cag gec ggt ctg 
Leu Asp Trp Arg Ser Ser Thr Leu His Leu Ala Leu Gin Ala Gly Leu 
180 185 190 

gaa tac ttc gec atg get ggc gec tgc ttc cag gac gtt tgc gtt gat 
Glu Tyr Phe Ala Met Ala Gly Ala Cys Phe Gin Asp Val Cys Val Asp 
195 200 205 

tgc tac cca gtc aat ttc gtt ttg gtc ctg cgt gee cac atg teg ate 
Cys Tyr Pro Val Asn Phe Val Leu Val Leu Arg Ala His Met Ser He 
210 215 220 

ttc gcg gag cgc ctt cga cgt ttg gga act tat cct tat gaa age cag 
Phe Ala Glu Arg Leu Arg Arg Leu Gly Thr Tyr Pro Tyr Glu Ser Gin 
225 230 235 240 

gag cag aaa tat gaa cga ttg gtt cag tgc ata caa gat cac aaa gta 
Glu Gin Lys Tyr Glu Arg Leu Val Gin Cys He Gin Asp His Lys Val 
245 250 255 

att ttg cga ttt gtt gac tgc ctg cgt cct gtt att tct ggt acc ate 



73 



He Leu Arg Phe Val Asp Cys Leu Arg Pro Val He Ser Gly Thr He 
260 265 270 



ttc gtg caa ttc ttg gtt gtg ggg ttg gtg ctg ggc ttt acc eta att 
Phe Val Gin Phe Leu Val Val Gly Leu Val Leu Gly Phe Thr Leu He 
275 280 285 

aac att gtc ctg ttc gec aac ttg gga teg gec ate gca gcg etc teg 
Asn He Val Leu Phe Ala Asn Leu Gly Ser Ala lie Ala Ala Leu Ser 
290 295 300 

ttt atg gec gca gtg ctt eta gag acg act ccc ttc tgc ata ttg tgc 
Phe Met Ala Ala Val Leu Leu Glu Thr Thr Pro Phe Cys He Leu Cys 
305 310 315 320 

aat tat etc aca gaa gac tgc tac aag ctg gee gat gec ctg ttt cag 
Asn Tyr Leu Thr Glu Asp Cys Tyr Lys Leu Ala Asp Ala Leu Phe Gin 
325 330 335 

tea aac tgg att gat gag gag aaa cga tac caa aag aca etc atg tac 
Ser Asn Trp He Asp Glu Glu Lys Arg Tyr Gin Lys Thr Leu Met Tyr 
340 345 350 

ttc eta cag aaa ctg cag cag cct ata acc ttc atg get atg aac gtg 
Phe Leu Gin Lys Leu Gin Gin Pro He Thr Phe Met Ala Met Asn Val 
355 360 365 

ttt cca ata tct gtg gga act aac ate agt gtc aca aaa ttt teg ttc 
Phe Pro He Ser Val Gly Thr Asn He Ser Val Thr Lys Phe Ser Phe 
370 375 380 

tec gtc ttt act etc gta aaa caa atg aac ata tct gag aaa ctt gee 
Ser Val Phe Thr Leu Val Lys Gin Met Asn He Ser Glu Lys Leu Ala 
385 390 395 400 

aaa tct gaa atg gaa gag 
Lys Ser Glu Met Glu Glu 
405 



<210> 38 
<211> 406 
<212> PRT 

<213> Drosophila melanogaster 
<400> 38 

Met Asp Leu Arg Arg Trp Phe Pro Thr Leu Tyr Thr Gin Ser Lys Asp 



74 



Ser Pro Val Arg Ser Arg Asp Ala Thr Leu Tyr Leu Leu Arg Cys Val 
20 25 30 

Phe Leu Met Gly Val Arg Lys Pro Pro Ala Lys Phe Phe Val Ala Tyr 
35 40 45 

Val Leu Trp Ser Phe Ala Leu Asn Phe Cys Ser Thr Phe Tyr Gin Pro 
50 55 60 

lie Gly Phe Leu Thr Gly Tyr lie Ser His Leu Ser Glu Phe Ser Pro 
65 70 75 80 

Gly Glu Phe Leu Thr Ser Leu Gin Val Ala Phe Asn Ala Trp Ser Cys 
85 90 95 

Ser Thr Lys Val Leu lie Val Trp Ala Leu Val Lys Arg Phe Asp Glu 
100 105 110 

Ala Asn Asn Leu Leu Asp Glu Met Asp Arg Arg He Thr Asp Pro Gly 
115 120 125 

Glu Arg Leu Gin He His Arg Ala Val Ser Leu Ser Asn Arg He Phe 
130 135 140 

Phe Phe Phe Met Ala Val Tyr Met Val Tyr Ala Thr Asn Thr Phe Leu 
145 150 155 160 

Ser Ala He Phe He Gly Arg Pro Pro Tyr Gin Asn Tyr Tyr Pro Phe 
165 170 175 

Leu Asp Trp Arg Ser Ser Thr Leu His Leu Ala Leu Gin Ala Gly Leu 
180 185 190 

Glu Tyr Phe Ala Met Ala Gly Ala Cys Phe Gin Asp Val Cys Val Asp 
195 200 205 

Cys Tyr Pro Val Asn Phe Val Leu Val Leu Arg Ala His Met Ser He 
210 215 220 

Phe Ala Glu Arg Leu Arg Arg Leu Gly Thr Tyr Pro Tyr Glu Ser Gin 
225 230 235 240 

Glu Gin Lys Tyr Glu Arg Leu Val Gin Cys He Gin Asp His Lys Val 
245 250 255 

He Leu Arg Phe Val Asp Cys Leu Arg Pro Val He Ser Gly Thr He 
260 265 270 



75 



Phe Val Gin Phe 
275 

Asn lie Val Leu 
290 

Phe Met Ala Ala 
305 

Asn Tyr Leu Thr 



Ser Asn Trp lie 
340 

Phe Leu Gin Lys 
355 

Phe Pro lie Ser 
370 

Ser Val Phe Thr 
385 

Lys Ser Glu Met 



Leu Val Val Gly 
280 

Phe Ala Asn Leu 
295 

Val Leu Leu Glu 
310 

Glu Asp Cys Tyr 
325 

Asp Glu Glu Lys 



Leu Gin Gin Pro 
360 

Val Gly Thr Asn 
375 

Leu Val Lys Gin 
390 

Glu Glu 
405 



Leu Val Leu Gly 



Gly Ser Ala lie 

300 

Thr Thr Pro Phe 
315 

Lys Leu Ala Asp 
330 

Arg Tyr Gin Lys 
345 

lie Thr Phe Met 



lie Ser Val Thr 
380 

Met Asn lie Ser 
395 



Phe Thr Leu lie 
285 

Ala Ala Leu Ser 



Cys lie Leu Cys 
320 

Ala Leu Phe Gin 
335 

Thr Leu Met Tyr 
350 

Ala Met Asn Val 
365 

Lys Phe Ser Phe 



Glu Lys Leu Ala 
400 



<210> 39 
<211> 1188 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1188) 

<223> DOR 45F.1 

<400> 39 

atg tat ccg cga ttc etc age cgt aac tat ccg ctg gec aag cat ttg 

Met Tyr Pro Arg Phe Leu Ser Arg Asn Tyr Pro Leu Ala Lys His Leu 
15 10 15 

ttc ttc gtc acc aga tac tec ttt ggc ctg ctg ggc ctg aga ttt ggc 
Phe Phe Val Thr Arg Tyr Ser Phe Gly Leu Leu Gly Leu Arg Phe Gly 
20 25 30 



76 



aaa gag caa teg tgg ctt cac etc ttg tgg ctg gtg ttc aat ttc gtt 
Lys Glu Gin Ser Trp Leu His Leu Leu Trp Leu Val Phe Asn Phe Val 
35 40 45 



aac ctg gcg cac tgc tgc cag gcg gag ttc gtc ttc ggc tgg agt cac 
Asn Leu Ala His Cys Cys Gin Ala Glu Phe Val Phe Gly Trp Ser His 
50 55 60 

ttg cgc acc agt ccc gtg gat gec atg gac gec ttt tgt cct ctg gec 
Leu Arg Thr Ser Pro Val Asp Ala Met Asp Ala Phe Cys Pro Leu Ala 
65 70 75 80 

tgc agt ttc acc acg etc ttc aag ctg gga tgg atg tgg tgg cgt cgc 
Cys Ser Phe Thr Thr Leu Phe Lys Leu Gly Trp Met Trp Trp Arg Arg 
85 90 95 

cag gaa gta get gat eta atg gac cgc ate cgc ttg etc ate ggg gag 
Gin Glu Val Ala Asp Leu Met Asp Arg lie Arg Leu Leu lie Gly Glu 
100 105 110 

cag gag aag agg gag gac tec egg aga aag gtg get eaa agg age tac 
Gin Glu Lys Arg Glu Asp Ser Arg Arg Lys Val Ala Gin Arg Ser Tyr 
115 120 125 

tat etc atg gtc acc agg tgc ggt atg ctg gtc ttc acc ctg ggc age 
Tyr Leu Met Val Thr Arg Cys Gly Met Leu Val Phe Thr Leu Gly Ser 
130 135 140 

att acc act gga gee ttc gtt ctg cgt tec ctt tgg gaa atg tgg gtg 
lie Thr Thr Gly Ala Phe Val Leu Arg Ser Leu Trp Glu Met Trp Val 
145 150 155 160 

cgt cgt cat cag gag ttc aaa ttc gat atg ccc ttt cgc atg ctg ttc 
Arg Arg His Gin Glu Phe Lys Phe Asp Met Pro Phe Arg Met Leu Phe 
165 170 175 

cac gac ttt gcg cat cgc atg ccc tgg ttt cca gtt ttc tat etc tac 
His Asp Phe Ala His Arg Met Pro Trp Phe Pro Val Phe Tyr Leu Tyr 
180 185 190 

tec aca tgg agt ggc cag gtc act gtg tac gec ttt get ggt aca gat 
Ser Thr Trp Ser Gly Gin Val Thr Val Tyr Ala Phe Ala Gly Thr Asp 
195 200 205 

ggt ttc ttc ttt ggc ttt acc etc tac atg gee ttc ttg ctg cag gec 
Gly Phe Phe Phe Gly Phe Thr Leu Tyr Met Ala Phe Leu Leu Gin Ala 
210 215 220 



PT i l l ! I B 



tta aga tac gat ate cag gat gec etc aag cca ata aga gat ccc teg 
Leu Arg Tyr Asp lie Gin Asp Ala Leu Lys Pro lie Arg Asp Pro Ser 
225 230 235 240 



720 



ctt agg gaa tec aaa ate tgc tgt cag cga ttg gcg gac ate gtg gat 
Leu Arg Glu Ser Lys lie Cys Cys Gin Arg Leu Ala Asp lie Val Asp 
245 250 255 

cgc cac aat gag ata gag aag ata gtc aag gaa ttt tct gga att atg 
Arg His Asn Glu lie Glu Lys lie Val Lys Glu Phe Ser Gly lie Met 
260 265 270 

get get cca act ttt gtt cac ttc gta tea gee age tta gtg ata gec 
Ala Ala Pro Thr Phe Val His Phe Val Ser Ala Ser Leu Val lie Ala 
275 280 285 

acc age gtc att gat ata eta ttg tat tec ggc tat aac ate ate cgt 
Thr Ser Val lie Asp lie Leu Leu Tyr Ser Gly Tyr Asn lie lie Arg 
290 295 300 

tac gtg gtg tac acc ttc acg gtt tec teg gec ate ttc etc tat tgc 
Tyr Val Val Tyr Thr Phe Thr Val Ser Ser Ala lie Phe Leu Tyr Cys 
305 310 315 320 

tac gga ggc aca gaa atg tea act gag age ctt tec ttg gga gaa gca 
Tyr Gly Gly Thr Glu Met Ser Thr Glu Ser Leu Ser Leu Gly Glu Ala 
325 330 335 

gee tac age agt gec tgg tat act tgg gat cga gag acc cgc agg egg 
Ala Tyr Ser Ser Ala Trp Tyr Thr Trp Asp Arg Glu Thr Arg Arg Arg 
340 345 350 

gtc ttt etc att ate ctg cgt get caa cga ccc att acg gtg agg gtg 
Val Phe Leu lie lie Leu Arg Ala Gin Arg Pro lie Thr Val Arg Val 
355 360 365 

ccc ttt ttt gca cca teg tta cca gtc ttc aca teg gtc ate aag ttt 
Pro Phe Phe Ala Pro Ser Leu Pro Val Phe Thr Ser Val lie Lys Phe 
370 375 380 

aca ggt teg att gtg gca ctg get aag acg ata ctg 
Thr Gly Ser lie Val Ala Leu Ala Lys Thr lie Leu 
385 390 395 



<210> 40 
<211> 396 



78 



<212> PRT 

<213> Drosophila melanogaster 



<400> 40 

Met Tyr Pro Arg Phe Leu Ser Arg Asn Tyr Pro Leu Ala Lys His Leu 



Phe Phe Val Thr Arg Tyr Ser Phe Gly Leu Leu Gly Leu Arg Phe Gly 



Lys Glu Gin Ser Trp Leu His Leu Leu Trp Leu Val Phe Asn Phe Val 
35 40 45 

Asn Leu Ala His Cys Cys Gin Ala Glu Phe Val Phe Gly Trp Ser His 



Leu Arg Thr Ser Pro Val Asp Ala Met Asp Ala Phe Cys Pro Leu Ala 



Cys Ser Phe Thr Thr Leu Phe Lys Leu Gly Trp Met Trp Trp Arg Arg 
85 90 95 

Gin Glu Val Ala Asp Leu Met Asp Arg lie Arg Leu Leu lie Gly Glu 
100 105 110 

Gin Glu Lys Arg Glu Asp Ser Arg Arg Lys Val Ala Gin Arg Ser Tyr 
115 120 125 

Tyr Leu Met Val Thr Arg Cys Gly Met Leu Val Phe Thr Leu Gly Ser 
130 135 140 

lie Thr Thr Gly Ala Phe Val Leu Arg Ser Leu Trp Glu Met Trp Val 
145 150 155 160 

Arg Arg His Gin Glu Phe Lys Phe Asp Met Pro Phe Arg Met Leu Phe 
165 170 175 

His Asp Phe Ala His Arg Met Pro Trp Phe Pro Val Phe Tyr Leu Tyr 
180 185 190 

Ser Thr Trp Ser Gly Gin Val Thr Val Tyr Ala Phe Ala Gly Thr Asp 
195 200 205 

Gly Phe Phe Phe Gly Phe Thr Leu Tyr Met Ala Phe Leu Leu Gin Ala 
210 215 220 

Leu Arg Tyr Asp He Gin Asp Ala Leu Lys Pro He Arg Asp Pro Ser 
225 230 235 240 
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Leu Arg Glu Ser Lys lie Cys Cys Gin Arg Leu Ala Asp lie Val Asp 
245 250 255 



Arg His Asn Glu He Glu Lys He 
260 

Ala Ala Pro Thr Phe Val His Phe 
275 280 

Thr Ser Val He Asp He Leu Leu 
290 295 

Tyr Val Val Tyr Thr Phe Thr Val 
305 310 

Tyr Gly Gly Thr Glu Met Ser Thr 
325 

Ala Tyr Ser Ser Ala Trp Tyr Thr 
340 

Val Phe Leu He He Leu Arg Ala 

355 360 

Pro Phe Phe Ala Pro Ser Leu Pro 
370 375 

Thr Gly Ser He Val Ala Leu Ala 
385 390 



Val Lys Glu Phe Ser Gly He Met 
265 270 

Val Ser Ala Ser Leu Val He Ala 
285 

Tyr Ser Gly Tyr Asn He He Arg 
300 

Ser Ser Ala He Phe Leu Tyr Cys 
315 320 

Glu Ser Leu Ser Leu Gly Glu Ala 
330 335 

Trp Asp Arg Glu Thr Arg Arg Arg 
345 350 

Gin Arg Pro He Thr Val Arg Val 
365 

Val Phe Thr Ser Val He Lys Phe 
380 

Lys Thr He Leu 
395 



<210> 41 
<211> 1158 
<212> DNA 

<213> Drosophila melanogaster 

<220> 
<221> CDS 
<222> (1) . . (1158) 
<223> DOR 49D.1 

<400> 41 

atg ttt gaa gac att cag eta ate tac atg aat ate aag ata ttg cga 
Met Phe Glu Asp He Gin Leu He Tyr Met Asn He Lys He Leu Arg 
15 10 15 



ttc tgg gcc ctg etc tat gac aaa aac ttg agg cgt tat gtg tgc att 
Phe Trp Ala Leu Leu Tyr Asp Lys Asn Leu Arg Arg Tyr Val Cys lie 



gga ctg gcc tea ttc cac ate ttc acc caa ate gtc tac atg atg agt 

Gly Leu Ala Ser Phe His lie Phe Thr Gin lie Val Tyr Met Met Ser 

35 40 45 

acc aat gaa gga eta acc ggg ata att cgt aac tea tat atg etc gtc 

Thr Asn Glu Gly Leu Thr Gly lie He Arg Asn Ser Tyr Met Leu Val 



ctt tgg att aat acg gtg ctg cga get tat etc ttg ctg gcg gat cac 
Leu Trp He Asn Thr Val Leu Arg Ala Tyr Leu Leu Leu Ala Asp His 



gac aga tat ttg get ttg ate caa aaa eta act gag gcc tat tac gat 
Asp Arg Tyr Leu Ala Leu He Gin Lys Leu Thr Glu Ala Tyr Tyr Asp 



tta ctg aat ctg aac gat teg tat ata teg gaa ata ttg gac cag gtg 
Leu Leu Asn Leu Asn Asp Ser Tyr He Ser Glu He Leu Asp Gin Val 
100 105 110 

aac aag gtg gga aag ttg atg get agg ggc aat ctg ttc ttt ggc atg 
Asn Lys Val Gly Lys Leu Met Ala Arg Gly Asn Leu Phe Phe Gly Met 
115 120 125 

etc aca tec atg gga ttc ggt ctg tac cca ttg tec tec age gaa aga 
Leu Thr Ser Met Gly Phe Gly Leu Tyr Pro Leu Ser Ser Ser Glu Arg 
130 135 140 

get ctt aat ttt aaa acc cac ttt cct ttt gca gtc ctg cca ttt ggc 
Ala Leu Asn Phe Lys Thr His Phe Pro Phe Ala Val Leu Pro Phe Gly 
145 150 155 160 

age aaa att cct ggt eta aat gag tac gag agt ccg tac tat gag atg 
Ser Lys He Pro Gly Leu Asn Glu Tyr Glu Ser Pro Tyr Tyr Glu Met 
165 170 175 

tgg tac ate ttt cag atg etc ate acc ccg atg ggc tgt tgc atg tac 
Trp Tyr He Phe Gin Met Leu He Thr Pro Met Gly Cys Cys Met Tyr 
180 185 190 

att ccg tac acc agt ctg att gtg ggc ttg ata atg ttc ggc att gtg 
He Pro Tyr Thr Ser Leu He Val Gly Leu He Met Phe Gly He Val 
195 200 205 



81 



agg tgc aag get ttg cag cat cgc etc cgc cag gtg gcg ctt aag cat 
Arg Cys Lys Ala Leu Gin His Arg Leu Arg Gin Val Ala Leu Lys His 
210 215 220 

ccg tac gga gat cgc gat ccc cgt gaa ctg agg gag gag ate ata gec 
Pro Tyr Gly Asp Arg Asp Pro Arg Glu Leu Arg Glu Glu lie He Ala 
225 230 235 240 

tgc ata cgt tac cag cag age att ate gag tac atg gat cac ata aac 
Cys He Arg Tyr Gin Gin Ser He He Glu Tyr Met Asp His He Asn 
245 250 255 

gag ctg acc acc atg atg ttc eta ttc gaa ctg atg gee ttt teg gcg 
Glu Leu Thr Thr Met Met Phe Leu Phe Glu Leu Met Ala Phe Ser Ala 
260 265 270 

ctg etc tgt gcg ctg etc ttt atg ctg att ate gtc age ggc acc agt 
Leu Leu Cys Ala Leu Leu Phe Met Leu He He Val Ser Gly Thr Ser 
275 280 285 

cag ctg ata att gtt tgc atg tac att aac atg att ctg gec caa ata 
Gin Leu He He Val Cys Met Tyr He Asn Met He Leu Ala Gin He 
290 295 300 

ctg gec etc tat tgg tat gca aat gag tta agg gaa cag aat ctg gcg 
Leu Ala Leu Tyr Trp Tyr Ala Asn Glu Leu Arg Glu Gin Asn Leu Ala 
305 310 315 320 

gtg gee acc gca gee tac gaa acg gag tgg ttc acc ttc gac gtt cca 
Val Ala Thr Ala Ala Tyr Glu Thr Glu Trp Phe Thr Phe Asp Val Pro 
325 330 335 

ctg cgc aaa aac ate ctg ttc atg atg atg agg gca cag egg cca get 
Leu Arg Lys Asn He Leu Phe Met Met Met Arg Ala Gin Arg Pro Ala 
340 345 350 

gca ata eta ctg ggc aat ata cgc ccc ate act ttg gaa ctg ttc caa 
Ala He Leu Leu Gly Asn He Arg Pro He Thr Leu Glu Leu Phe Gin 
355 360 365 

aac eta ctg aac aca acc tat aca ttt ttt acg gtt etc aag cga gtc 
Asn Leu Leu Asn Thr Thr Tyr Thr Phe Phe Thr Val Leu Lys Arg Val 
370 375 380 

tac gga 
Tyr Gly 
385 



<210> 42 
<211> 386 
<212> PRT 

<213> Drosophila melanogaster 
<400> 42 

Met Phe Glu Asp lie Gin Leu lie Tyr Met Asn lie Lys lie Leu Arg 



Phe Trp Ala Leu Leu Tyr Asp Lys Asn Leu Arg Arg Tyr Val Cys lie 
20 25 30 

Gly Leu Ala Ser Phe His lie Phe Thr Gin lie Val Tyr Met Met Ser 



Thr Asn Glu Gly Leu Thr Gly lie lie Arg Asn Ser Tyr Met Leu Val 
50 55 60 

Leu Trp lie Asn Thr Val Leu Arg Ala Tyr Leu Leu Leu Ala Asp His 
65 70 75 80 

Asp Arg Tyr Leu Ala Leu lie Gin Lys Leu Thr Glu Ala Tyr Tyr Asp 



Leu Leu Asn Leu Asn Asp Ser Tyr lie Ser Glu lie Leu Asp Gin Val 
100 105 110 

Asn Lys Val Gly Lys Leu Met Ala Arg Gly Asn Leu Phe Phe Gly Met 
115 120 125 

Leu Thr Ser Met Gly Phe Gly Leu Tyr Pro Leu Ser Ser Ser Glu Arg 
130 135 140 

Ala Leu Asn Phe Lys Thr His Phe Pro Phe Ala Val Leu Pro Phe Gly 
145 150 155 160 

Ser Lys lie Pro Gly Leu Asn Glu Tyr Glu Ser Pro Tyr Tyr Glu Met 
165 170 175 

Trp Tyr lie Phe Gin Met Leu lie Thr Pro Met Gly Cys Cys Met Tyr 
180 185 190 

lie Pro Tyr Thr Ser Leu He Val Gly Leu He Met Phe Gly He Val 
195 200 205 

Arg Cys Lys Ala Leu Gin His Arg Leu Arg Gin Val Ala Leu Lys His 
210 215 220 
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Pro Tyr Gly Asp Arg Asp Pro Arg Glu Leu Arg Glu Glu lie lie Ala 
225 230 235 240 



Cys lie Arg Tyr Gin Gin Ser lie 
245 

Glu Leu Thr Thr Met Met Phe Leu 
260 

Leu Leu Cys Ala Leu Leu Phe Met 
275 280 

Gin Leu lie lie Val Cys Met Tyr 
290 295 

Leu Ala Leu Tyr Trp Tyr Ala Asn 
305 310 

Val Ala Thr Ala Ala Tyr Glu Thr 
325 

Leu Arg Lys Asn lie Leu Phe Met 
340 

Ala lie Leu Leu Gly Asn lie Arg 
355 360 

Asn Leu Leu Asn Thr Thr Tyr Thr 
370 375 



lie Glu Tyr Met Asp His lie Asn 
250 255 

Phe Glu Leu Met Ala Phe Ser Ala 
265 270 

Leu lie lie Val Ser Gly Thr Ser 
285 

lie Asn Met He Leu Ala Gin He 
300 

Glu Leu Arg Glu Gin Asn Leu Ala 
315 320 

Glu Trp Phe Thr Phe Asp Val Pro 
330 335 

Met Met Arg Ala Gin Arg Pro Ala 
345 350 

Pro He Thr Leu Glu Leu Phe Gin 
365 

Phe Phe Thr Val Leu Lys Arg Val 
380 



Tyr Gly 
385 



<210> 43 
<211> 1359 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1359) 

<223> DOR 56E. 1 

<400> 43 

atg gtt aac get aaa cag ttt aac atg ttt aaa gtt aag gat ctg ttg 



Met Val Asn Ala Lys Gin Phe Asn Met Phe Lys Val Lys Asp Leu Leu 
15 10 15 



ctt teg ccg aca act ttc gag gat cca att ttt gga acc cac ctg cga 
Leu Ser Pro Thr Thr Phe Glu Asp Pro lie Phe Gly Thr His Leu Arg 



tac ttc caa tgg tac gga tat gtg gec tec aag gat cag aat agg cct 

Tyr Phe Gin Trp Tyr Gly Tyr Val Ala Ser Lys Asp Gin Asn Arg Pro 
35 40 45 

ttg tta agt ctt ata egg tgc acc att ttg acg gca teg att tgg ctt 

Leu Leu Ser Leu lie Arg Cys Thr lie Leu Thr Ala Ser lie Trp Leu 



age tgt get tta atg ctg gcg aga gtg ttt cgt ggt tac gaa aac etc 
Ser Cys Ala Leu Met Leu Ala Arg Val Phe Arg Gly Tyr Glu Asn Leu 
65 70 75 80 

aat gat ggg gee aca agt tac gee acc gca gtc cag tat ttc gcg gta 
Asn Asp Gly Ala Thr Ser Tyr Ala Thr Ala Val Gin Tyr Phe Ala Val 
85 90 95 

teg att gee atg ttt aat get tac gta caa aga gat aga tat gtt ctt 
Ser lie Ala Met Phe Asn Ala Tyr Val Gin Arg Asp Arg Tyr Val Leu 
100 105 110 

tta tac tta cac att gtt tta gaa gta ata tec ctt ttg cga gtt gee 
Leu Tyr Leu His lie Val Leu Glu Val lie Ser Leu Leu Arg Val Ala 
115 120 125 

cac teg gat ate cag aac ttg atg cac gaa gca gat aat egg gag atg 
His Ser Asp lie Gin Asn Leu Met His Glu Ala Asp Asn Arg Glu Met 
130 135 140 

gaa ctt ttg gtc gec act cag get tat aca cga acc att acc ctg ttg 
Glu Leu Leu Val Ala Thr Gin Ala Tyr Thr Arg Thr lie Thr Leu Leu 
145 150 155 160 

ate tgg ata cca teg gtt att get ggc eta atg gec tat tea gac tgc 
lie Trp lie Pro Ser Val lie Ala Gly Leu Met Ala Tyr Ser Asp Cys 
165 170 175 

ate tac agg agt ctg ttt ctg ccg aaa teg gtt ttc aat gtg cca get 
He Tyr Arg Ser Leu Phe Leu Pro Lys Ser Val Phe Asn Val Pro Ala 
180 185 190 

gtg cga cgt ggt gag gag cat ccc att ctg eta ttt cag ctg ttt ccc 

85 



Val Arg Arg Gly Glu Glu His Pro lie Leu Leu Phe Gin Leu Phe Pro 
195 200 205 



ttc gga gaa ctt tgc gat aac ttc gtt gtt gga tac ttg gga cct tgg 
Phe Gly Glu Leu Cys Asp Asn Phe Val Val Gly Tyr Leu Gly Pro Trp 
210 215 220 

tat get ctg ggc ctg gga ate acg get ate cca ttg tgg cac ace ttt 
Tyr Ala Leu Gly Leu Gly lie Thr Ala lie Pro Leu Trp His Thr Phe 
225 230 235 240 

ate act tgc etc atg aag tac gta aat etc aag ctg caa ata etc aac 
lie Thr Cys Leu Met Lys Tyr Val Asn Leu Lys Leu Gin lie Leu Asn 
245 250 255 

aag cga gtg gag gag atg gat att acc cga ctt aat tec aaa ttg gta 
Lys Arg Val Glu Glu Met Asp lie Thr Arg Leu Asn Ser Lys Leu Val 
260 265 270 

att ggt cgc eta act gec agt gag tta acc ttc tgg caa atg caa etc 
lie Gly Arg Leu Thr Ala Ser Glu Leu Thr Phe Trp Gin Met Gin Leu 
275 280 285 

ttc aag gaa ttt gta aag gaa cag ctg agg att cga aaa ttt gtc cag 
Phe Lys Glu Phe Val Lys Glu Gin Leu Arg lie Arg Lys Phe Val Gin 
290 295 300 

gaa eta cag tat ctg att tgc gtg cct gtg atg gca gat ttc att ate 
Glu Leu Gin Tyr Leu lie Cys Val Pro Val Met Ala Asp Phe He He 
305 310 315 320 

ttc teg gtt etc att tgc ttt etc ttt ttt gec ttg aca gtt ggc gtt 
Phe Ser Val Leu He Cys Phe Leu Phe Phe Ala Leu Thr Val Gly Val 
325 330 335 

cca age aaa atg gat tac ttc ttc atg ttc att tac ctt ttt gtg atg 
Pro Ser Lys Met Asp Tyr Phe Phe Met Phe He Tyr Leu Phe Val Met 
340 345 350 

get ggt ata ttg tgg att tat cat tgg cat gec acg ttg att gtt gaa 
Ala Gly He Leu Trp He Tyr His Trp His Ala Thr Leu He Val Glu 
355 360 365 

tgt cac gat gaa ctg age ctt get tac ttt tct tgc gga tgg tac aac 
Cys His Asp Glu Leu Ser Leu Ala Tyr Phe Ser Cys Gly Trp Tyr Asn 
370 375 380 

ttc gaa atg cct ttg cag aaa atg ctg gtt ttt atg atg atg cat gec 



86 



Phe Glu Met Pro Leu Gin Lys Met Leu Val Phe Met Met Met His Ala 
385 390 395 400 



caa agg ccg atg aag atg cgc gcc ctg ctg gtc gat ttg aat ctg agg 
Gin Arg Pro Met Lys Met Arg Ala Leu Leu Val Asp Leu Asn Leu Arg 
405 410 415 

acc ttc ata gac gta agg ctg eta act get aac teg ata ttg gat tta 
Thr Phe lie Asp Val Arg Leu Leu Thr Ala Asn Ser lie Leu Asp Leu 
420 425 430 

teg aat tea age ctt tec ttt cca gat tgg ccg tgg age eta cag eta 
Ser Asn Ser Ser Leu Ser Phe Pro Asp Trp Pro Trp Ser 'Leu Gin Leu 
435 440 445 

ctt caa ttt get gcg 
Leu Gin Phe Ala Ala 
450 



<210> 44 
<211> 453 
<212> PRT 
<213> Drosophils 

<400> 44 
Met Val Asn Ala 
1 

Leu Ser Pro Thr 
20 

Tyr Phe Gin Trp 
35 

Leu Leu Ser Leu 
50 

Ser Cys Ala Leu 
65 

Asn Asp Gly Ala 



Ser He Ala Met 
100 

Leu Tyr Leu His 



i melanogaster 

Lys Gin Phe Asn 
5 

Thr Phe Glu Asp 

Tyr Gly Tyr Val 
40 

He Arg Cys Thr 
55 

Met Leu Ala Arg 
70 

Thr Ser Tyr Ala 

85 

Phe Asn Ala Tyr 

He Val Leu Glu 
87 



Met Phe Lys Val 
10 

Pro He Phe Gly 
25 

Ala Ser Lys Asp 



He Leu Thr Ala 
60 

Val Phe Arg Gly 
75 

Thr Ala Val Gin 
90 

Val Gin Arg Asp 
105 

Val lie Ser Leu 



Lys Asp Leu Leu 
15 

Thr His Leu Arg 
30 

Gin Asn Arg Pro 
45 

Ser He Trp Leu 



Tyr Glu Asn Leu 

80 

Tyr Phe Ala Val 
95 

Arg Tyr Val Leu 
110 

Leu Arg Val Ala 



' 1 1 1 I' M 



m 



115 



120 



125 



His Ser Asp lie 
130 

Glu Leu Leu Val 
145 

lie Trp lie Pro 



lie Tyr Arg Ser 
180 

Val Arg Arg Gly 
195 

Phe Gly Glu Leu 
210 

Tyr Ala Leu Gly 
225 

lie Thr Cys Leu 



Lys Arg Val Glu 
260 

lie Gly Arg Leu 
275 

Phe Lys Glu Phe 
290 

Glu Leu Gin Tyr 
305 

Phe Ser Val Leu 



Pro Ser Lys Met 
340 

Ala Gly lie Leu 
355 

Cys His Asp Glu 



Gin Asn Leu Met 
135 

Ala Thr Gin Ala 
150 

Ser Val He Ala 
165 

Leu Phe Leu Pro 



Glu Glu His Pro 
200 

Cys Asp Asn Phe 
215 

Leu Gly He Thr 
230 

Met Lys Tyr Val 
245 

Glu Met Asp He 



Thr Ala Ser Glu 
280 

Val Lys Glu Gin 
295 

Leu He Cys Val 
310 

He Cys Phe Leu 
325 

Asp Tyr Phe Phe 



Trp He Tyr His 
360 

Leu Ser Leu Ala 



His Glu Ala Asp 
140 

Tyr Thr Arg Thr 
155 

Gly Leu Met Ala 
170 

Lys Ser Val Phe 
185 

He Leu Leu Phe 



Val Val Gly Tyr 
220 

Ala He Pro Leu 
235 

Asn Leu Lys Leu 
250 

Thr Arg Leu Asn 
265 

Leu Thr Phe Trp 



Leu Arg He Arg 
300 

Pro Val Met Ala 
315 

Phe Phe Ala Leu 
330 

Met Phe He Tyr 
345 

Trp His Ala Thr 



Tyr Phe Ser Cys 



Asn Arg Glu Met 



He Thr Leu Leu 
160 

Tyr Ser Asp Cys 
175 

Asn Val Pro Ala 
190 

Gin Leu Phe Pro 
205 

Leu Gly Pro Trp 



Trp His Thr Phe 

240 

Gin lie Leu Asn 
255 

Ser Lys Leu Val 
270 

Gin Met Gin Leu 
285 

Lys Phe Val Gin 



Asp Phe He He 

. 320 

Thr Val Gly Val 
335 

Leu Phe Val Met 
350 

Leu He Val Glu 
365 

Gly Trp Tyr Asn 



370 

Phe Glu Met Pro 
385 

Gin Arg Pro Met 



Thr Phe lie Asp 
420 

Ser Asn Ser Ser 
435 

Leu Gin Phe Ala 
450 



375 

Leu Gin Lys Met 
390 

Lys Met Arg Ala 
405 

Val Arg Leu Leu 

Leu Ser Phe Pro 
440 

Ala 



380 

Leu Val Phe Met 
395 

Leu Leu Val Asp 
410 

Thr Ala Asn Ser 
425 

Asp Trp Pro Trp 



Met Met His Ala 
400 

Leu Asn Leu Arg 
415 

lie Leu Asp Leu 
430 

Ser Leu Gin Leu 
445 



<210> 45 
<211> 1278 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1278) 

<223> DOR 69F.1 

<400> 45 

atg cag ttg cac gac cat atg aag tac 
Met Gin Leu His Asp His Met Lys Tyr 
1 5 

gca tgc ata cca aga tat caa tgg aaa 
Ala Cys lie Pro Arg Tyr Gin Trp Lys 
20 25 

ttc tac get teg gag caa agg ata gtg 
Phe Tyr Ala Ser Glu Gin Arg lie Val 
35 40 

cag ata ttc cag att act gga gtg ctt 
Gin lie Phe Gin lie Thr Gly Val Leu 
50 55 

cgt ctt gec acg gaa acg ggc acc ttt 
Arg Leu Ala Thr Glu Thr Gly Thr Phe 



ata gac ttg ggt tgc aag atg 48 
lie Asp Leu Gly Cys Lys Met 
10 15 

gga cgc cct act gaa aga cag 96 
Gly Arg Pro Thr Glu Arg Gin 
30 

ttc ctt ctt gga acc att tgc 144 
Phe Leu Leu Gly Thr He Cys 
45 

ate tat tgg tat tgc aat ggc 192 
He Tyr Trp Tyr Cys Asn Gly 
60 

gtg gca caa tta tct gaa atg 240 
Val Ala Gin Leu Ser Glu Met 
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tgc agt tct ttt tgt eta aca ttt gtg gga ttc tgt aac gtt tat gcg 
Cys Ser Ser Phe Cys Leu Thr Phe Val Gly Phe Cys Asn Val Tyr Ala 



ate tct aca aac cgc aat caa att gaa aca tta etc gag gag ctt cat 
lie Ser Thr Asn Arg Asn Gin lie Glu Thr Leu Leu Glu Glu Leu His 
100 105 110 

cag ata tat ccg aga tac agg aaa aat cac tat cgc tgc cag cat tat 
Gin lie Tyr Pro Arg Tyr Arg Lys Asn His Tyr Arg Cys Gin His Tyr 
115 120 125 

ttt gac atg gec atg aca ata atg aga att gag ttt ctt ttc tat atg 
Phe Asp Met Ala Met Thr lie Met Arg lie Glu Phe Leu Phe Tyr Met 
130 135 140 

ate ttg tac gtg tac tac aat agt gca cca tta tgg gtg ctt ctt tgg 
lie Leu Tyr Val Tyr Tyr Asn Ser Ala Pro Leu Trp Val Leu Leu Trp 
145 150 155 160 

gaa cac ttg cac gag gaa tat gat ctt age ttc aag acg cag ace aac 
Glu His Leu His Glu Glu Tyr Asp Leu Ser Phe Lys Thr Gin Thr Asn 
165 170 175 

act tgg ttt cca tgg aaa gtc cat ggg teg gca ctt gga ttt ggt atg 
Thr Trp Phe Pro Trp Lys Val His Gly Ser Ala Leu Gly Phe Gly Met 
180 185 190 

get gta eta age ata ace gtg gga tec ttt gtg ggc gta ggt ttc agt 
Ala Val Leu Ser lie Thr Val Gly Ser Phe Val Gly Val Gly Phe Ser 
195 200 205 

att gtc acc cag aat ctt ate tgt ttg tta acc ttc caa eta aag ttg 
lie Val Thr Gin Asn Leu lie Cys Leu Leu Thr Phe Gin Leu Lys Leu 
210 215 220 

cac tac gat gga ata tec agt cag tta gta tct etc gat tgc cgt cgt 
His Tyr Asp Gly lie Ser Ser Gin Leu Val Ser Leu Asp Cys Arg Arg 
225 230 235 240 

cct gga get cat aag gag ttg age ate etc ate gec cac cac age cga 
Pro Gly Ala His Lys Glu Leu Ser He Leu He Ala His His Ser Arg 
245 250 255 

ate ctt cag ctg ggc gac caa gtc aat gac ata atg aac ttt gta ttc 
He Leu Gin Leu Gly Asp Gin Val Asn Asp He Met Asn Phe Val Phe 



90 



260 



265 



270 



ggc tct age eta gta ggt gec act att gec att tgt atg tea agt gtt 
Gly Ser Ser Leu Val Gly Ala Thr lie Ala He Cys Met Ser Ser Val 
275 280 285 

tct ata atg eta ctg gac tta gca tct gec ttc aaa tat gee agt ggt 
Ser He Met Leu Leu Asp Leu Ala Ser Ala Phe Lys Tyr Ala Ser Gly 
290 295 300 

eta gtg gca ttc gtc etc tac aac ttt gtc ate tgc tac atg gga acc 
Leu Val Ala Phe Val Leu Tyr Asn Phe Val He Cys Tyr Met Gly Thr 
305 310 315 320 

gag gtc act tta get cgt ata aag gtc ggt aat atg ggg caa ata cga 
Glu Val Thr Leu Ala Arg He Lys Val Gly Asn Met Gly Gin He Arg 
325 330 335 

cag cca cgt ttt aga gca gga tgg aat ttg aga act act tta agt att 
Gin Pro Arg Phe Arg Ala Gly Trp Asn Leu Arg Thr Thr Leu Ser He 
340 345 350 

ttg aca gca ttt tgc gtc tgg cga tgt ttc cac gag gaa gat ttg tat 
Leu Thr Ala Phe Cys Val Trp Arg Cys Phe His Glu Glu Asp Leu Tyr 
355 360 365 

cca acg ttt cga agg gca ttc ttt ttg eta ggt aac ttt tgc ctg get 
Pro Thr Phe Arg Arg Ala Phe Phe Leu Leu Gly Asn Phe Cys Leu Ala 
370 375 380 

tac caa tgt att gga gta att ata gat tgt ata gat tgg ttc ata tat 
Tyr Gin Cys lie Gly Val He He Asp Cys He Asp Trp Phe He Tyr 
385 390 395 400 

gga egg aag gcg gtg gat acc caa aga ttc gtt get gag ate tea gag 
Gly Arg Lys Ala Val Asp Thr Gin Arg Phe Val Ala Glu He Ser Glu 
405 410 415 

get aca ggt get cgt cgc agt tgg att ttt 
Ala Thr Gly Ala Arg Arg Ser Trp lie Phe 
420 425 



<210> 46 
<211> 426 
<212> PRT 

<213> Drosophila melanogaster 
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<400> 46 

Met Gin Leu His Asp His Met Lys Tyr lie Asp Leu Gly Cys Lys Met 
15 10 15 



Ala Cys lie Pro Arg Tyr Gin Trp Lys Gly Arg Pro Thr Glu Arg Gin 



Phe Tyr Ala Ser Glu Gin Arg lie Val Phe Leu Leu Gly Thr lie Cys 
35 40 45 

Gin lie Phe Gin lie Thr Gly Val Leu He Tyr Trp Tyr Cys Asn Gly 
50 55 60 

Arg Leu Ala Thr Glu Thr Gly Thr Phe Val Ala Gin Leu Ser Glu Met 
65 70 75 80 

Cys Ser Ser Phe Cys Leu Thr Phe Val Gly Phe Cys Asn Val Tyr Ala 
85 90 95 

He Ser Thr Asn Arg Asn Gin He Glu Thr Leu Leu Glu Glu Leu His 
100 105 110 

Gin He Tyr Pro Arg Tyr Arg Lys Asn His Tyr Arg Cys Gin His Tyr 
115 120 ' 125 

Phe Asp Met Ala Met Thr He Met Arg He Glu Phe Leu Phe Tyr Met 
130 135 140 

He Leu Tyr Val Tyr Tyr Asn Ser Ala Pro Leu Trp Val Leu Leu Trp 
145 150 155 160 

Glu His Leu His Glu Glu Tyr Asp Leu Ser Phe Lys Thr Gin Thr Asn 
165 170 175 

Thr Trp Phe Pro Trp Lys Val His Gly Ser Ala Leu Gly Phe Gly Met 
180 185 190 

Ala Val Leu Ser He Thr Val Gly Ser Phe Val Gly Val Gly Phe Ser 
195 200 205 

He Val Thr Gin Asn Leu He Cys Leu Leu Thr Phe Gin Leu Lys Leu 
210 215 220 

His Tyr Asp Gly He Ser Ser Gin Leu Val Ser Leu Asp Cys Arg Arg 
225 230 235 240 

Pro Gly Ala His Lys Glu Leu Ser He Leu He Ala His His Ser Arg 
245 250 255 
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lie Leu Gin Leu Gly Asp Gin Val Asn Asp lie Met Asn Phe Val Phe 
260 265 270 



Gly Ser Ser Leu Val Gly Ala Thr lie Ala lie Cys Met Ser Ser Val 
275 280 285 

Ser lie Met Leu Leu Asp Leu Ala Ser Ala Phe Lys Tyr Ala Ser Gly 
290 295 300 

Leu Val Ala Phe Val Leu Tyr Asn Phe Val lie Cys Tyr Met Gly Thr 
305 310 315 320 

Glu Val Thr Leu Ala Arg lie Lys Val Gly Asn Met Gly Gin lie Arg 
325 330 335 

Gin Pro Arg Phe Arg Ala Gly Trp Asn Leu Arg Thr Thr Leu Ser lie 
340 345 350 

Leu Thr Ala Phe Cys Val Trp Arg Cys Phe His Glu Glu Asp Leu Tyr 
355 360 365 

Pro Thr Phe Arg Arg Ala Phe Phe Leu Leu Gly Asn Phe Cys Leu Ala 
370 375 380 

Tyr Gin Cys lie Gly Val lie lie Asp Cys lie Asp Trp Phe lie Tyr 
385 390 395 400 

Gly Arg Lys Ala Val Asp Thr Gin Arg Phe Val Ala Glu lie Ser Glu 
405 410 415 

Ala Thr Gly Ala Arg Arg Ser Trp lie Phe 
420 425 



<210> 47 
<211> 1242 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1242) 

<223> DOR 69F.2 

<400> 47 

atg cag ttg gag gac ttt atg egg tac ccg gac etc gtg tgt caa gcg 
93 



Met Gin Leu Glu Asp Phe Met Arg Tyr Pro Asp Leu Val Cys Gin Ala 
15 10 15 



gcc caa ctt ccc aga tac acg tgg aat ggc aga cga tec ttg gaa gtt 
Ala Gin Leu Pro Arg Tyr Thr Trp Asn Gly Arg Arg Ser Leu Glu Val 
20 25 30 

aaa cgc aac ttg gca aaa cgc att ate ttc tgg ctt gga gca gta aat 
Lys Arg Asn Leu Ala Lys Arg lie lie Phe Trp Leu Gly Ala Val Asn 
35 40 45 

ttg gtt tat cac aat att ggc tgc gtc atg tat ggc tat ttc ggt gat 
Leu Val Tyr His Asn lie Gly Cys Val Met Tyr Gly Tyr Phe Gly Asp 
50 55 60 

gga aga aca aag gat cca att gcg tat tta get gaa ttg gca tct gtg 
Gly Arg Thr Lys Asp Pro lie Ala Tyr Leu Ala Glu Leu Ala Ser Val 
65 70 75 80 

gcc age atg ctt ggt ttc ace att gtg ggc acc etc aac ttg tgg aag 
Ala Ser Met Leu Gly Phe Thr lie Val Gly Thr Leu Asn Leu Trp Lys 



atg ctg age ctt aag acc cat ttt gag aac eta eta aat gaa ttc gag 
Met Leu Ser Leu Lys Thr His Phe Glu Asn Leu Leu Asn Glu Phe Glu 
100 105 110 

gaa tta ttt caa eta ate aag cac agg gcg tat cgc ata cac cac tat 
Glu Leu Phe Gin Leu lie Lys His Arg Ala Tyr Arg lie His His Tyr 
115 120 125 

caa gaa aag tat acg cgt cat ata cga aat aca ttt att ttc cat acc 
Gin Glu Lys Tyr Thr Arg His He Arg Asn Thr Phe He Phe His Thr 
130 135 140 

tct gcc gtt gtc tac tac aac tea eta cca att ctt eta atg att egg 
Ser Ala Val Val Tyr Tyr Asn Ser Leu Pro He Leu Leu Met He Arg 
145 150 155 160 

gaa cat ttc teg aac tea cag cag ttg ggc tat aga att cag agt aat 
Glu His Phe Ser Asn Ser Gin Gin Leu Gly Tyr Arg He Gin Ser Asn 
165 170 175 

acc tgg tat ccc tgg cag gtt cag gga tea att cct gga ttt ttt get 
Thr Trp Tyr Pro Trp Gin Val Gin Gly Ser He Pro Gly Phe Phe Ala 
180 185 190 

gca gtc gcc tgt caa ate ttt teg tgc caa acc aat atg tgc gtc aat 
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Ala Val Ala Cys Gin lie Phe Ser Cys Gin Thr Asn Met Cys Val Asn 
195 200 205 



atg ttt ate cag ttt ctg ate aac ttt ttt ggt ate cag eta gaa ata 
Met Phe lie Gin Phe Leu lie Asn Phe Phe Gly lie Gin Leu Glu lie 
210 215 220 

cac ttc gat ggt ttg gec agg cag ctg gag acc ate gat gec cgc aat 
His Phe Asp Gly Leu Ala Arg Gin Leu Glu Thr lie Asp Ala Arg Asn 
225 230 235 240 

ccc cat gec aag gat caa ttg aag tat ctg att gta tat cac aca aaa 
Pro His Ala Lys Asp Gin Leu Lys Tyr Leu He Val Tyr His Thr Lys 
245 250 255 

ttg ctt aat eta gee gac aga gtt aat cga teg ttt aac ttt acg ttt 
Leu Leu Asn Leu Ala Asp Arg Val Asn Arg Ser Phe Asn Phe Thr Phe 
260 265 270 

etc ata agt ctg teg gta tec atg ata tec aac tgt ttt ctg gca ttt 
Leu He Ser Leu Ser Val Ser Met He Ser Asn Cys Phe Leu Ala Phe 
275 280 285 

tec atg acc atg ttc gac ttt ggc acc tct eta aaa cat tta etc gga 
Ser Met Thr Met Phe Asp Phe Gly Thr Ser Leu Lys His Leu Leu Gly 
290 295 300 

ctt ttg eta ttc ate aca tat aat ttt tea atg tgc cgc agt ggt acg 
Leu Leu Leu Phe He Thr Tyr Asn Phe Ser Met Cys Arg Ser Gly Thr 
305 310 315 320 

cac ttg att tta acg agt ggc aaa gta ttg cca gcg gec ttt tat aac 
His Leu He Leu Thr Ser Gly Lys Val Leu Pro Ala Ala Phe Tyr Asn 
325 330 335 

aat tgg tat gaa ggc gat ctt gtt tat cga agg atg etc etc ate ctg 
Asn Trp Tyr Glu Gly Asp Leu Val Tyr Arg Arg Met Leu Leu He Leu 
340 345 350 

atg atg cgt get acg aaa cct tat atg tgg aaa acc tac aag ctg gca 
Met Met Arg Ala Thr Lys Pro Tyr Met Trp Lys Thr Tyr Lys Leu Ala 
355 360 365 

cct gta tec ata act aca tat atg gca gtg agt ttt tec tta ctt aca 
Pro Val Ser He Thr Thr Tyr Met Ala Val Ser Phe Ser Leu Leu Thr 
370 375 380 

tgg cat tta tta ttc aat ttt aat tea tgt gtt ggc ttt cag aca ttg 
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Trp His Leu Leu Phe Asn Phe Asn Ser Cys Val Gly Phe Gin Thr Leu 
385 390 395 400 

aag ttt tea tat caa atg ttt acc tgt gtg egg tec ctt aaa 1242 
Lys Phe Ser Tyr Gin Met Phe Thr Cys Val Arg Ser Leu Lys 



<210> 48 
<211> 414 
<212> PRT 

<213> Drosophila melanogaster 
<400> 48 

Met Gin Leu Glu Asp Phe Met Arg Tyr Pro Asp Leu Val Cys Gin Ala 



Ala Gin Leu Pro Arg Tyr Thr Trp Asn Gly Arg Arg Ser Leu Glu Val 

20 25 30 

Lys Arg Asn Leu Ala Lys Arg lie He Phe Trp Leu Gly Ala Val Asn 

35 40 45 

Leu Val Tyr His Asn He Gly Cys Val Met Tyr Gly Tyr Phe Gly Asp 
50 55 60 

Gly Arg Thr Lys Asp Pro He Ala Tyr Leu Ala Glu Leu Ala Ser Val 



Ala Ser Met Leu Gly Phe Thr He Val Gly Thr Leu Asn Leu Trp Lys 

85 90 95 

Met Leu Ser Leu Lys Thr His Phe Glu Asn Leu Leu Asn Glu Phe Glu 

100 105 110 

Glu Leu Phe Gin Leu He Lys His Arg Ala Tyr Arg He His His Tyr 

115 120 125 

Gin Glu Lys Tyr Thr Arg His He Arg Asn Thr Phe He Phe His Thr 

130 135 140 

Ser Ala Val Val Tyr Tyr Asn Ser Leu Pro He Leu Leu Met He Arg 

145 150 155 160 

Glu His Phe Ser Asn Ser Gin Gin Leu Gly Tyr Arg He Gin Ser Asn 

165 170 175 

Thr Trp Tyr Pro Trp Gin Val Gin Gly Ser He Pro Gly Phe Phe Ala 



m 



180 



185 



190 



Ala Val Ala Cys Gin lie Phe Ser Cys Gin Thr Asn Met Cys Val Asn 
195 200 205 

Met Phe lie Gin Phe Leu lie Asn Phe Phe Gly lie Gin Leu Glu lie 
210 215 220 

His Phe Asp Gly Leu Ala Arg Gin Leu Glu Thr lie Asp Ala Arg Asn 
225 230 235 240 

Pro His Ala Lys Asp Gin Leu Lys Tyr Leu lie Val Tyr His Thr Lys 
245 250 255 

Leu Leu Asn Leu Ala Asp Arg Val Asn Arg Ser Phe Asn Phe Thr Phe 
260 265 270 

Leu He Ser Leu Ser Val Ser Met He Ser Asn Cys Phe Leu Ala Phe 
275 280 285 

Ser Met Thr Met Phe Asp Phe Gly Thr Ser Leu Lys His Leu Leu Gly 
290 295 300 

Leu Leu Leu Phe He Thr Tyr Asn Phe Ser Met Cys Arg Ser Gly Thr 
305 310 315 320 

His Leu He Leu Thr Ser Gly Lys Val Leu Pro Ala Ala Phe Tyr Asn 
325 330 335 

Asn Trp Tyr Glu Gly Asp Leu Val Tyr Arg Arg Met Leu Leu He Leu 
340 345 350 

Met Met Arg Ala Thr Lys Pro Tyr Met Trp Lys Thr Tyr Lys Leu Ala 
355 360 365 

Pro Val Ser He Thr Thr Tyr Met Ala Val Ser Phe Ser Leu Leu Thr 
370 375 380 

Trp His Leu Leu Phe Asn Phe Asn Ser Cys Val Gly Phe Gin Thr Leu 
385 390 395 400 

Lys Phe Ser Tyr Gin Met Phe Thr Cys Val Arg Ser Leu Lys 
405 410 



<210> 49 
<211> 1170 
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<212> DNA 

<213> Drosophila melanogaster 



<220> 

<221> CDS 

<222> (1) . . (1170) 

<223> DOR 85A. 1 



<400> 49 

atg gaa gag eta atg aag tac get age ttc ttt aca cag cag tgg gca 48 
Met Glu Glu Leu Met Lys Tyr Ala Ser Phe Phe Thr Gin Gin Trp Ala 
15 10 15 

tac ggg cat ata cca atg ggt gaa gaa tec aaa agg aac aaa ctt ata 96 
Tyr Gly His lie Pro Met Gly Glu Glu Ser Lys Arg Asn Lys Leu He 
20 25 30 

ttt cac ata gtt ttt tgg tec aat gtg att aac ctg age ttc gtt gga 144 
Phe His He Val Phe Trp Ser Asn Val He Asn Leu Ser Phe Val Gly 
35 40 45 

tta ttt gag age att tac gtt tac agt gec ttc atg gat aat aag ttc 192 
Leu Phe Glu Ser He Tyr Val Tyr Ser Ala Phe Met Asp Asn Lys Phe 
50 55 60 



ctg gaa gca gtc act gcg ttg tec tac att ggc ttc gta acc gta ggc 
Leu Glu Ala Val Thr Ala Leu Ser Tyr He Gly Phe Val Thr Val Gly 
65 70 75 . 80 

atg age aag atg ttc ttc ate egg tgg aag aaa acg get ata act gaa 
Met Ser Lys Met Phe Phe He Arg Trp Lys Lys Thr Ala He Thr Glu 
85 90 95 

ctg att aat gaa ttg aag gag ate tat ccg aat ggt ttg ate cga gag 
Leu He Asn Glu Leu Lys Glu He Tyr Pro Asn Gly Leu He Arg Glu 
100 105 110 

gaa aga tac aat ctg ccg atg tat ctg ggc acc tgc tec aga ate age 
Glu Arg Tyr Asn Leu Pro Met Tyr Leu Gly Thr Cys Ser Arg He Ser 
115 120 125 

ctt ata tat tec ttg etc tac tct gtt etc ate tgg aca ttc aac ttg 
Leu He Tyr Ser Leu Leu Tyr Ser Val Leu He Trp Thr Phe Asn Leu 
130 135 140 



ttt tgt gta atg gag tat tgg gtc tat gac aag tgg etc aac att cga 
Phe Cys Val Met Glu Tyr Trp Val Tyr Asp Lys Trp Leu Asn He Arg 
145 150 155 160 



gtg gtg ggc aaa cag ttg ccg tac etc atg tac att cct tgg aaa tgg 
Val Val Gly Lys Gin Leu Pro Tyr Leu Met Tyr lie Pro Trp Lys Trp 
165 170 175 

cag gat aac tgg teg tac tat cca ctg tta ttc tec cag aat ttt gca 
Gin Asp Asn Trp Ser Tyr Tyr Pro Leu Leu Phe Ser Gin Asn Phe Ala 
180 185 190 

gga tac aca tct gca get ggt caa att tea acc gat gtc ttg etc tgc 
Gly Tyr Thr Ser Ala Ala Gly Gin lie Ser Thr Asp Val Leu Leu Cys 
195 200 205 

gcg gtg gee act cag ttg gta atg cac ttc gac ttt etc tea aat agt 
Ala Val Ala Thr Gin Leu Val Met His Phe Asp Phe Leu Ser Asn Ser 
210 215 220 

atg gaa cgc cac gaa ttg agt gga gat tgg aag aag gac tec cga ttt 
Met Glu Arg His Glu Leu Ser Gly Asp Trp Lys Lys Asp Ser Arg Phe 
225 230 235 240 

ctg gtg gac att gtt agg tat cac gaa cgt ata etc cgc ctt tea gat 
Leu Val Asp lie Val Arg Tyr His Glu Arg lie Leu Arg Leu Ser Asp 
245 250 255 

gca gtg aac gat ata ttt gga att cca eta eta etc aac ttc atg gta 
Ala Val Asn Asp lie Phe Gly lie Pro Leu Leu Leu Asn Phe Met Val 
260 265 270 

tec teg ttc gtc ate tgc ttc gtg gga ttc cag atg act gtt gga gtt 
Ser Ser Phe Val He Cys Phe Val Gly Phe Gin Met Thr Val Gly Val 
275 280 285 

ccg ccg gat ata gtt gtg aag etc ttc etc ttc ctt gtc tct teg atg 
Pro Pro Asp He Val Val Lys Leu Phe Leu Phe Leu Val Ser Ser Met 
290 295 300 

agt cag gtc tat ttg att tgt cac tat ggt caa ctg gtg gee gat get 
Ser Gin Val Tyr Leu He Cys His Tyr Gly Gin Leu Val Ala Asp Ala 
305 310 315 320 

age tac gga ttt teg gtt gee acc tac aat cag aag tgg tat aaa gee 
Ser Tyr Gly Phe Ser Val Ala Thr Tyr Asn Gin Lys Trp Tyr Lys Ala 
325 330 335 

gat gtg cgc tat aaa cga gee ttg gtt att att ata get aga teg cag 
Asp Val Arg Tyr Lys Arg Ala Leu Val He He He Ala Arg Ser Gin 
340 345 350 
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aag gta act ttt 
Lys Val Thr Phe 
355 



eta aag gec 
Leu Lys Ala 



act ata ttc ttg gat att acc agg tec 1104 
Thr lie Phe Leu Asp lie Thr Arg Ser 
360 365 



act atg aca gat ctg ctt caa ata tea tac aaa ttc ttc gec ctg ctg 1152 
Thr Met Thr Asp Leu Leu Gin lie Ser Tyr Lys Phe Phe Ala Leu Leu 
370 375 380 

cgc aca atg tat acc caa 1170 
Arg Thr Met Tyr Thr Gin 
385 390 



<210> 50 
<211> 390 
<212> PRT 

<213> Drosophila melanogaster 
<400> 50 

Met Glu Glu Leu Met Lys Tyr Ala Ser Phe Phe Thr Gin Gin Trp Ala 
15 10 15 

Tyr Gly His lie Pro Met Gly Glu Glu Ser Lys Arg Asn Lys Leu lie 
20 25 30 

Phe His lie Val Phe Trp Ser Asn Val lie Asn Leu Ser Phe Val Gly 
35 40 45 

Leu Phe Glu Ser lie Tyr Val Tyr Ser Ala Phe Met Asp Asn Lys Phe 
50 55 60 

Leu Glu Ala Val Thr Ala Leu Ser Tyr lie Gly Phe Val Thr Val Gly 
65 70 75 80 

Met Ser Lys Met Phe Phe lie Arg Trp Lys Lys Thr Ala lie Thr Glu 
85 90 95 

Leu lie Asn Glu Leu Lys Glu lie Tyr Pro Asn Gly Leu lie Arg Glu 
100 105 110 

Glu Arg Tyr Asn Leu Pro Met Tyr Leu Gly Thr Cys Ser Arg lie Ser 
115 120 125 

Leu lie Tyr Ser Leu Leu Tyr Ser Val Leu lie Trp Thr Phe Asn Leu 
130 135 140 

Phe Cys Val Met Glu Tyr Trp Val Tyr Asp Lys Trp Leu Asn lie Arg 

100 



145 



150 



155 



160 



Val Val Gly Lys Gin 
165 

Gin Asp Asn Trp Ser 

180 

Gly Tyr Thr Ser Ala 
195 

Ala Val Ala Thr Gin 
210 

Met Glu Arg His Glu 
225 

Leu Val Asp lie Val 
245 

Ala Val Asn Asp lie 
260 

Ser Ser Phe Val He 
275 

Pro Pro Asp He Val 
290 

Ser Gin Val Tyr Leu 
305 

Ser Tyr Gly Phe Ser 
325 

Asp Val Arg Tyr Lys 
340 

Lys Val Thr Phe Leu 
355 

Thr Met Thr Asp Leu 
370 

Arg Thr Met Tyr Thr 
385 



Leu Pro Tyr Leu Met Tyr 

170 

Tyr Tyr Pro Leu Leu Phe 
185 

Ala Gly Gin He Ser Thr 
200 

Leu Val Met His Phe Asp 
215 

Leu Ser Gly Asp Trp Lys 
230 235 

Arg Tyr His Glu Arg He 
250 

Phe Gly lie Pro Leu Leu 
265 

Cys Phe Val Gly Phe Gin 
280 

Val Lys Leu Phe Leu Phe 
295 

He Cys His Tyr Gly Gin 
310 315 

Val Ala Thr Tyr Asn Gin 

330 

Arg Ala Leu Val He He 
345 

Lys Ala Thr He Phe Leu 
360 

Leu Gin He Ser Tyr Lys 
375 

Gin 
390 



He Pro Trp Lys Trp 
175 

Ser Gin Asn Phe Ala 
190 

Asp Val Leu Leu Cys 
205 

Phe Leu Ser Asn Ser 
220 

Lys Asp Ser Arg Phe 
240 

Leu Arg Leu Ser Asp 
255 

Leu Asn Phe Met Val 
270 

Met Thr Val Gly Val 
285 

Leu Val Ser Ser Met 
300 

Leu Val Ala Asp Ala 
320 

Lys Trp Tyr Lys Ala 
335 

He Ala Arg Ser Gin 
350 

Asp He Thr Arg Ser 
365 

Phe Phe Ala Leu Leu 
380 



101 



<210> 51 
<211> 1167 
<212> DNA 

<213> Drosophila melanogaster 

<220> 
<221> CDS 
<222> (1) . . (1167) 
<223> DOR 85A.3 

<400> 51 

atg aag ttc atg aag tac gca gtt ttc ttt tac aca teg gtg ggc att 48 
Met Lys Phe Met Lys Tyr Ala Val Phe Phe Tyr Thr Ser Val Gly lie 
15 10 15 

gag ccg tat acg att gac teg egg tec aaa aaa gcg age eta tgg tea 96 
Glu Pro Tyr Thr lie Asp Ser Arg Ser Lys Lys Ala Ser Leu Trp Ser 
20 25 30 

cat ctt etc ttc tgg gec aat gtg ate aat tta agt gtc att gtt ttc 144 
His Leu Leu Phe Trp Ala Asn Val lie Asn Leu Ser Val lie Val Phe 
35 40 45 

gga gag ate etc tat ctg gga gtg gec tat tec gat gga aag ttc att 192 
Gly Glu lie Leu Tyr Leu Gly Val Ala Tyr Ser Asp Gly Lys Phe lie 
50 55 60 

gat gec gtc act gta ctg tea tat ate gga ttc gta ate gtg ggc atg 240 
Asp Ala Val Thr Val Leu Ser Tyr lie Gly Phe Val lie Val Gly Met 
65 70 75 80 

age aag atg ttc ttc ata tgg tgg aag aag acc gat eta age gat ttg 288 
Ser Lys Met Phe Phe lie Trp Trp Lys Lys Thr Asp Leu Ser Asp Leu 
85 90 95 

gtt aag gaa ttg gag cac ate tat cca aat ggc aaa get gag gag gag 336 
Val Lys Glu Leu Glu His lie Tyr Pro Asn Gly Lys Ala Glu Glu Glu 
100 105 110 

atg tat egg ttg gat agg tat ctg cga tct tgt tea cga att age att 384 
Met Tyr Arg Leu Asp Arg Tyr Leu Arg Ser Cys Ser Arg lie Ser lie 
115 120 125 

acc tat gca eta etc tac tec gta etc ate tgg acc ttc aat ctg ttc 432 
Thr Tyr Ala Leu Leu Tyr Ser Val Leu lie Trp Thr Phe Asn Leu Phe 
130 135 140 
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agt ate atg caa ttc ctt gtc tat gaa aag ttg ctt aaa ate cga gtg 480 
Ser lie Met Gin Phe Leu Val Tyr Glu Lys Leu Leu Lys He Arg Val 
145 150 155 160 



gtc ggc caa acg ctg cca tat ttg atg tac ttt ccc tgg aac tgg cat 528 
Val Gly Gin Thr Leu Pro Tyr Leu Met Tyr Phe Pro Trp Asn Trp His 
165 170 175 



gaa aac tgg acg tat tat gtg ctg ctg ttc tgt caa aac ttc gca gga 576 
Glu Asn Trp Thr Tyr Tyr Val Leu Leu Phe Cys Gin Asn Phe Ala Gly 
180 185 190 



cat act teg gca teg gga cag ate tct acg gat ctt ttg ctt tgt get 624 
His Thr Ser Ala Ser Gly Gin He Ser Thr Asp Leu Leu Leu Cys Ala 
195 200 205 



gtt get ace cag gtg gta atg cac ttc gat tac ttg gee aga gtg gtg 672 
Val Ala Thr Gin Val Val Met His Phe Asp Tyr Leu Ala Arg Val Val 
210 215 220 



gaa aaa caa gtg tta gat cgc gat tgg age gaa aac tec aga ttt ttg 720 
Glu Lys Gin Val Leu Asp Arg Asp Trp Ser Glu Asn Ser Arg Phe Leu 
225 230 235 240 



gca aaa act gta caa tat cat cag cgc att ctt egg eta atg gac gtt 7 68 
Ala Lys Thr Val Gin Tyr His Gin Arg He Leu Arg Leu Met Asp Val 
245 250 255 



etc aac gat ata ttc ggg ata ccg eta ctg ctt aac ttt atg gtc tec 816 
Leu Asn Asp He Phe Gly He Pro Leu Leu Leu Asn Phe Met Val Ser 
260 265 270 



aca ttt gtc ate tgc ttt gtg gga ttc caa atg ace gtg ggt gtc ccg 8 64 
Thr Phe Val He Cys Phe Val Gly Phe Gin Met Thr Val Gly Val Pro 
275 280 285 



ccg gac ate atg att aag etc ttc ttg ttc ctg ttc teg tec ttg teg 912 
Pro Asp He Met lie Lys Leu Phe Leu Phe Leu Phe Ser Ser Leu Ser 
290 295 300 



caa gtg tac ttg ata tgc cac tac 
Gin Val Tyr Leu He Cys His Tyr 
305 310 

tct age tta teg att tct gca tat 
Ser Ser Leu Ser He Ser Ala Tyr 
325 



ggc cag ctg att gec gat gcg age 960 
Gly Gin Leu He Ala Asp Ala Ser 
315 320 

aag cag aat tgg caa aat get gac 1008 
Lys Gin Asn Trp Gin Asn Ala Asp 
330 335 
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att cgc tat cgt egg get 
lie Arg Tyr Arg Arg Ala 
340 

aca act tat eta aaa get 
Thr Thr Tyr Leu Lys Ala 
355 

atg acg gac ctt ctt caa 
Met Thr Asp Leu Leu Gin 
370 

ace atg tac ata aag 
Thr Met Tyr lie Lys 

385 



ctg gta ttc ttt ata get 
Leu Val Phe Phe lie Ala 
345 

aca att ttc atg aat ata 
Thr lie Phe Met Asn lie 
360 

gta tec tac aaa ttt ttc 
Val Ser Tyr Lys Phe Phe 
375 380 



cga cct cag agg 1056 
Arg Pro Gin Arg 
350 

aca agg gee ace 1104 

Thr Arg Ala Thr 

365 

get ctg ctt cgt 1152 
Ala Leu Leu Arg 
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<210> 52 
<211> 389 
<212> PRT 

<213> Drosophila melanogaster 
<400> 52 

Met Lys Phe Met Lys Tyr Ala Val Phe Phe Tyr Thr Ser Val Gly lie 
15 10 15 

Glu Pro Tyr Thr lie Asp Ser Arg Ser Lys Lys Ala Ser Leu Trp Ser 
20 25 30 

His Leu Leu Phe Trp Ala Asn Val lie Asn Leu Ser Val lie Val Phe 
35 40 45 

Gly Glu lie Leu Tyr Leu Gly Val Ala Tyr Ser Asp Gly Lys Phe lie 
50 55 60 

Asp Ala Val Thr Val Leu Ser Tyr lie Gly Phe Val lie Val Gly Met 
65 70 75 80 

Ser Lys Met Phe Phe lie Trp Trp Lys Lys Thr Asp Leu Ser Asp Leu 
85 90 95 

Val Lys Glu Leu Glu His lie Tyr Pro Asn Gly Lys Ala Glu Glu Glu 
100 105 110 

Met Tyr Arg Leu Asp Arg Tyr Leu Arg Ser Cys Ser Arg lie Ser lie 
115 120 125 

Thr Tyr Ala Leu Leu Tyr Ser Val Leu lie Trp Thr Phe Asn Leu Phe 
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I 



130 



135 



140 



Ser lie Met Gin Phe Leu Val Tyr Glu Lys Leu Leu Lys lie Arg Val 
145 150 155 160 

Val Gly Gin Thr Leu Pro Tyr Leu Met Tyr Phe Pro Trp Asn Trp His 
165 170 175 

Glu Asn Trp Thr Tyr Tyr Val Leu Leu Phe Cys Gin Asn Phe Ala Gly 
180 185 190 

His Thr Ser Ala Ser Gly Gin He Ser Thr Asp Leu Leu Leu Cys Ala 
195 200 205 

Val Ala Thr Gin Val Val Met His Phe Asp Tyr Leu Ala Arg Val Val 
210 215 220 

Glu Lys Gin Val Leu Asp Arg Asp Trp Ser Glu Asn Ser Arg Phe Leu 
225 230 235 240 

Ala Lys Thr Val Gin Tyr His Gin Arg He Leu Arg Leu Met Asp Val 
245 250 255 

Leu Asn Asp He Phe Gly He Pro Leu Leu Leu Asn Phe Met Val Ser 
260 265 270 

Thr Phe Val He Cys Phe Val Gly Phe Gin Met Thr Val Gly Val Pro 
275 280 285 

Pro Asp He Met He Lys Leu Phe Leu Phe Leu Phe Ser Ser Leu Ser 
290 295 300 

Gin Val Tyr Leu He Cys His Tyr Gly Gin Leu He Ala Asp Ala Ser 
305 310 315 320 

Ser Ser Leu Ser He Ser Ala Tyr Lys Gin Asn Trp Gin Asn Ala Asp 
325 330 335 

He Arg Tyr Arg Arg Ala Leu Val Phe Phe He Ala Arg Pro Gin Arg 
340 345 350 

Thr Thr Tyr Leu Lys Ala Thr He Phe Met Asn He Thr Arg Ala Thr 
355 360 365 

Met Thr Asp Leu Leu Gin Val Ser Tyr Lys Phe Phe Ala Leu Leu Arg 
370 375 380 

Thr Met Tyr He Lys 
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385 



<210> 53 
<211> 1305 
<212> DNA 

<213> Drosophila melanogaster 

<220> 
<221> CDS 
<222> (1) . . (1305) 
<223> DOR 85B. 1 



<400> 53 

atg gga etc cag ttg gcg aat ggc acg aag cca teg ccg egg tta ccc 
Met Gly Leu Gin Leu Ala Asn Gly Thr Lys Pro Ser Pro Arg Leu Pro 



aaa tgg tgg cca aag egg ctg gaa atg att ggt aaa gtg ctg ccc aaa 
Lys Trp Trp Pro Lys Arg Leu Glu Met lie Gly Lys Val Leu Pro Lys 



gec tat tgt tec atg gtg att ttc acc tec ctg cat ttg ggt gtc ctg 
Ala Tyr Cys Ser Met Val He Phe Thr Ser Leu His Leu Gly Val Leu 



ttc acg aaa acc aca ctg gat gtc ctg ccg acg ggg gag ctg cag gec 
Phe Thr Lys Thr Thr Leu Asp Val Leu Pro Thr Gly Glu Leu Gin Ala 
50 55 60 

ata acg gat gec etc acc atg acc ata ata tac ttt ttc acg ggc tac 
He Thr Asp Ala Leu Thr Met Thr He He Tyr Phe Phe Thr Gly Tyr 
65 70 75 80 

ggc acc ate tac tgg tgc ctg cgc tec egg cgc etc ttg gec tac atg 
Gly Thr He Tyr Trp Cys Leu Arg Ser Arg Arg Leu Leu Ala Tyr Met 
85 90 95 

gag cac atg aac egg gag tat cgc cat cat teg ctg gec ggg gtg acc 
Glu His Met Asn Arg Glu Tyr Arg His His Ser Leu Ala Gly Val Thr 
100 105 110 

ttt gtg agt age cat gcg gec ttt agg atg tec aga aac ttc acg gtg 
Phe Val Ser Ser His Ala Ala Phe Arg Met Ser Arg Asn Phe Thr Val 
115 120 125 

gtg tgg ata atg tec tgc ctg ctg ggc gtg att tec tgg ggc gtt teg 
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Val Trp He Met Ser Cys Leu Leu Gly Val He Ser Trp Gly Val Ser 
130 135 140 



cca ctg atg ctg ggc ate egg atg ctg ccg etc caa tgt tgg tat ccc 
Pro Leu Met Leu Gly He Arg Met Leu Pro Leu Gin Cys Trp Tyr Pro 
145 150 155 160 

ttc gac gec ctg ggt ccc ggc aca tat acg gcg gtc tat get aca caa 
Phe Asp Ala Leu Gly Pro Gly Thr Tyr Thr Ala Val Tyr Ala Thr Gin 
165 170 175 

ctt ttc ggt cag ate atg gtg ggc atg acc ttt gga ttc ggg gga tea 
Leu Phe Gly Gin He Met Val Gly Met Thr Phe Gly Phe Gly Gly Ser 
180 185 190 

ctg ttt gtc acc ctg age ctg eta etc ctg gga caa ttc gat gtg etc 
Leu Phe Val Thr Leu Ser Leu Leu Leu Leu Gly Gin Phe Asp Val Leu 
195 200 205 

tac tgc age ctg aag aac ctg gat gee cat acc aag ttg ctg ggc ggg 
Tyr Cys Ser Leu Lys Asn Leu Asp Ala His Thr Lys Leu Leu Gly Gly 
210 215 220 

gag tct gta aat ggc ctg agt teg ctg caa gag gag ttg ctg ctg ggg 
Glu Ser Val Asn Gly Leu Ser Ser Leu Gin Glu Glu Leu Leu Leu Gly 
225 230 235 240 

gac teg aag agg gaa tta aat cag tac gtt ttg etc cag gag cat ccg 
Asp Ser Lys Arg Glu Leu Asn Gin Tyr Val Leu Leu Gin Glu His Pro 
245 250 255 

acg gat ctg ctg aga ttg teg gca gga cga aaa tgt cct gac caa gga 
Thr Asp Leu Leu Arg Leu Ser Ala Gly Arg Lys Cys Pro Asp Gin Gly 
260 265 270 

aat gcg ttt cac aac gee ttg gtg gaa tgc att cgc ttg cat cgc ttc 
Asn Ala Phe His Asn Ala Leu Val Glu Cys He Arg Leu His Arg Phe 
275 280 285 

att ctg cac tgc tea cag gag ttg gag aat eta ttc agt cca tat tgt 
He Leu His Cys Ser Gin Glu Leu Glu Asn Leu Phe Ser Pro Tyr Cys 
290 295 300 

ctg gtc aag tea ctg cag ate acc ttt cag ctt tgc ctg ctg gtc ttt 
Leu Val Lys Ser Leu Gin lie Thr Phe Gin Leu Cys Leu Leu Val Phe 
305 310 315 320 

gtg ggc gtt teg ggt act cga gag gtc ctg egg att gtc aac cag eta 
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Val Gly val Ser Gly Thr Arg Glu Val Leu Arg He Val Asn Gin Leu 
325 330 335 

cag tac ttg gga ctg acc ate ttc gag etc eta atg ttc acc tat tgt 
Gin Tyr Leu Gly Leu Thr He Phe Glu Leu Leu Met Phe Thr Tyr Cys 
340 345 350 

ggc gaa etc etc agt egg cat agt att cga tct ggc gac gee ttt tgg 
Gly Glu Leu Leu Ser Arg His Ser He Arg Ser Gly Asp Ala Phe Trp 
355 360 365 

agg ggt gcg tgg tgg aag cac gec cat ttc ate cgc cag gac ate etc 
Arg Gly Ala Trp Trp Lys His Ala His Phe He Arg Gin Asp He Leu 
370 375 380 

ate ttt ctg gtc aat agt aga cgt gca gtt cac gtg act gec ggc aag 
He Phe Leu Val Asn Ser Arg Arg Ala Val His Val Thr Ala Gly Lys 
385 390 395 400 

ttt tat gtg atg gat gtg aat cgt eta aga teg gtt ata acg cag gcg 
Phe Tyr Val Met Asp Val Asn Arg Leu Arg Ser Val lie Thr Gin Ala 
405 410 415 

ttc age ttc ttg act ttg ctg caa aag ttg get gec aag aag acg gaa 
Phe Ser Phe Leu Thr Leu Leu Gin Lys Leu Ala Ala Lys Lys Thr Glu 
420 425 430 

teg gag etc 
Ser Glu Leu 
435 



<210> 54 
<211> 435 
<212> PRT 

<213> Drosophila melanogaster 
<400> 54 

Met Gly Leu Gin Leu Ala Asn Gly 
1 5 

Lys Trp Trp Pro Lys Arg Leu Glu 
20 

Ala Tyr Cys Ser Met Val He Phe 

35 40 

Phe Thr Lys Thr Thr Leu Asp Val 



Thr Lys Pro Ser Pro Arg Leu Pro 
10 15 

Met He Gly Lys Val Leu Pro Lys 
25 30 

Thr Ser Leu His Leu Gly Val Leu 
45 

Leu Pro Thr Gly Glu Leu Gin Ala 
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50 



55 



60 



lie Thr Asp Ala 
65 

Gly Thr He Tyr 



Glu His Met Asn 

100 

Phe Val Ser Ser 
115 

Val Trp He Met 
130 

Pro Leu Met Leu 
145 

Phe Asp Ala Leu 



Leu Phe Gly Gin 

180 

Leu Phe Val Thr 
195 

Tyr Cys Ser Leu 
210 

Glu Ser Val Asn 
225 

Asp Ser Lys Arg 



Thr Asp Leu Leu 
260 

Asn Ala Phe His 
275 

He Leu His Cys 
290 

Leu Val Lys Ser 



Leu Thr Met Thr 

70 

Trp Cys Leu Arg 

85 

Arg Glu Tyr Arg 



His Ala Ala Phe 
120 

Ser Cys Leu Leu 
135 

Gly He Arg Met 
150 

Gly Pro Gly Thr 
165 

He Met Val Gly 



Leu Ser Leu Leu 
200 

Lys Asn Leu Asp 
215 

Gly Leu Ser Ser 
230 

Glu Leu Asn Gin 
245 

Arg Leu Ser Ala 



Asn Ala Leu Val 
280 

Ser Gin Glu Leu 
295 

Leu Gin He Thr 



He He Tyr Phe 
75 

Ser Arg Arg Leu 
90 

His His Ser Leu 
105 

Arg Met Ser Arg 



Gly Val He Ser 
140 

Leu Pro Leu Gin 
155 

Tyr Thr Ala Val 
170 

Met Thr Phe Gly 

185 

Leu Leu Gly Gin 



Ala His Thr Lys 
220 

Leu Gin Glu Glu 
235 

Tyr Val Leu Leu 
250 

Gly Arg Lys Cys 
265 

Glu Cys He Arg 



Glu Asn Leu Phe 
300 

Phe Gin Leu Cys 



Phe Thr Gly Tyr 
80 

Leu Ala Tyr Met 
95 

Ala Gly Val Thr 
110 

Asn Phe Thr Val 
125 

Trp Gly Val Ser 



Cys Trp Tyr Pro 
160 

Tyr Ala Thr Gin 
175 

Phe Gly Gly Ser 
190 

Phe Asp Val Leu 
205 

Leu Leu Gly Gly 



Leu Leu Leu Gly 
240 

Gin Glu His Pro 
255 

Pro Asp Gin Gly 
270 

Leu His Arg Phe 
285 

Ser Pro Tyr Cys 



Leu Leu Val Phe 
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305 

Val Gly Val Ser 



Gin Tyr Leu Gly 
340 

Gly Glu Leu Leu 
355 

Arg Gly Ala Trp 
370 

He Phe Leu Val 
385 

Phe Tyr Val Met 



Phe Ser Phe Leu 
420 

Ser Glu Leu 
435 



310 

Gly Thr Arg Glu 
325 

Leu Thr He Phe 



Ser Arg His Ser 
360 

Trp Lys His Ala 
375 

Asn Ser Arg Arg 
390 

Asp Val Asn Arg 
405 

Thr Leu Leu Gin 



315 

Val Leu Arg He 
330 

Glu Leu Leu Met 
345 

He Arg Ser Gly 



His Phe He Arg 
380 

Ala Val His Val 
395 

Leu Arg Ser Val 
410 

Lys Leu Ala Ala 
425 



320 

Val Asn Gin Leu 
335 

Phe Thr Tyr Cys 
350 

Asp Ala Phe Trp 
365 

Gin Asp He Leu 



Thr Ala Gly Lys 
400 

He Thr Gin Ala 
415 

Lys Lys Thr Glu 
430 



<210> 55 
<211> 1203 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1203) 

<400> 55 

atg aag cca acg gaa ate aaa aaa ccc tat cga atg gag gag ttt ctg 
Met Lys Pro Thr Glu He Lys Lys Pro Tyr Arg Met Glu Glu Phe Leu 



cgt ccg cag atg ttc cag gag gtg get cag atg gtg cat ttc cag tgg 96 
Arg Pro Gin Met Phe Gin Glu Val Ala Gin Met Val His Phe Gin Trp 



egg aga aat ccg gtg gac aac age atg gtg aac gca tec atg gtc ccc 
Arg Arg Asn Pro Val Asp Asn Ser Met Val Asn Ala Ser Met Val Pro 



IIIIJIJ I J, .IWU 1 ,1111 1 „ 



35 



40 



45 



ttc tgc ttg teg gcg ttt ctt aat gtc ctg ttt ttc ggc tgc aat ggt 
Phe Cys Leu Ser Ala Phe Leu Asn Val Leu Phe Phe Gly Cys Asn Gly 
50 55 60 

tgg gac ate ata gga cat ttt tgg ctg gga cat cct gec aac cag aat 
Trp Asp He He Gly His Phe Trp Leu Gly His Pro Ala Asn Gin Asn 
65 70 75 80 

ccg ccc gtg ctt age ate acc att tac ttc teg ate agg gga ttg atg 
Pro Pro Val Leu Ser He Thr He Tyr Phe Ser He Arg Gly Leu Met 
85 90 95 

eta tac ctg aaa cga aag gaa ate gtt gag ttt gtt aac gac ttg gat 
Leu Tyr Leu Lys Arg Lys Glu He Val Glu Phe Val Asn Asp Leu Asp 
100 105 HO 

egg gag tgt ccg egg gac ttg gtc age cag ttg gac atg caa atg gat 
Arg Glu Cys Pro Arg Asp Leu Val Ser Gin Leu Asp Met Gin Met Asp 
115 120 125 

gag acg tac cga aac ttt tgg cag cgc tat cgc ttc ate cgt ate tac 
Glu Thr Tyr Arg Asn Phe Trp Gin Arg Tyr Arg Phe He Arg He Tyr 
130 135 140 

tec cat ttg ggt ggt ccg atg ttc tgc gtt gtg cca tta get eta ttc 
Ser His Leu Gly Gly Pro Met Phe Cys Val Val Pro Leu Ala Leu Phe 
145 150 155 160 

etc ctg acc cac gag ggt aaa gat act cct gtt gee cag cac gag cag 
Leu Leu Thr His Glu Gly Lys Asp Thr Pro Val Ala Gin His Glu Gin 
165 170 175 

etc ctt gga gga tgg ctg cca tgc ggt gtg cga aag gac cca aat ttc 
Leu Leu Gly Gly Trp Leu Pro Cys Gly Val Arg Lys Asp Pro Asn Phe 
180 185 190 

tac ctt tta gtc tgg tec ttc gac ctg atg tgc acc act tgc ggc gtc 
Tyr Leu Leu Val Trp Ser Phe Asp Leu Met Cys Thr Thr Cys Gly Val 
195 200 205 

tec ttt ttc gtt acc ttc gac aac eta ttc aat gtg atg cag gga cat 
Ser Phe Phe Val Thr Phe Asp Asn Leu Phe Asn Val Met Gin Gly His 
210 215 220 

ttg gtc atg cat ttg ggc cat ctt get cgc cag ttt teg gee ate gat 
Leu Val Met His Leu Gly His Leu Ala Arg Gin Phe Ser Ala He Asp 
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235 



240 



cct cga cag agt ttg acc gat gag aag cga ttc ttt gtg gat ctt agg 
Pro Arg Gin Ser Leu Thr Asp Glu Lys Arg Phe Phe Val Asp Leu Arg 
245 250 255 

tta tta gtt cag agg cag cag ctt ctt aat gga ttg tgc aga aaa tac 
Leu Leu Val Gin Arg Gin Gin Leu Leu Asn Gly Leu Cys Arg Lys Tyr 
260 265 270 

aac gac ate ttt aaa gtg gec ttc ctg gtg age aat ttt gta ggc gec 
Asn Asp He Phe Lys Val Ala Phe Leu Val Ser Asn Phe Val Gly Ala 
275 280 285 

ggt tec etc tgc ttc tac etc ttt atg etc teg gag aca tea gat gtc 
Gly Ser Leu Cys Phe Tyr Leu Phe Met Leu Ser Glu Thr Ser Asp Val 
290 295 300 

ctt ate ate gec cag tat ata tta ccc act ttg gtc ctg gtg ggc ttc 
Leu He He Ala Gin Tyr He Leu Pro Thr Leu Val Leu Val Gly Phe 
305 310 315 320 

aca ttt gag att tgt eta egg gga acc caa ctg gaa aag gcg teg gag 
Thr Phe Glu He Cys Leu Arg Gly Thr Gin Leu Glu Lys Ala Ser Glu 
325 330 335 

gga ctg gaa teg teg ttg cga age cag gaa tgg tat ttg gga agt agg 
Gly Leu Glu Ser Ser Leu Arg Ser Gin Glu Trp Tyr Leu Gly Ser Arg 
340 345 350 

egg tac egg aag ttc tat ttg etc tgg acg caa tat tgc cag cga aca 
Arg Tyr Arg Lys Phe Tyr Leu Leu Trp Thr Gin Tyr Cys Gin Arg Thr 
355 360 365 

cag caa ctg ggc gec ttt ggg eta ate caa gtc aat atg gtg cac ttc 
Gin Gin Leu Gly Ala Phe Gly Leu He Gin Val Asn Met Val His Phe 
370 375 380 

act gaa ata atg cag ctg gee tat aga etc ttc act ttt etc aaa tct 
Thr Glu He Met Gin Leu Ala Tyr Arg Leu Phe Thr Phe Leu Lys Ser 
385 390 395 400 



cat 
His 



<210> 56 
<211> 401 



112 



<212> PRT 

<213> Drosophila melanogaster 



<400> 56 

Met Lys Pro Thr Glu lie Lys Lys Pro Tyr Arg Met Glu Glu Phe Leu 

1 5 10 15 

Arg Pro Gin Met Phe Gin Glu Val Ala Gin Met Val His Phe Gin Trp 



Arg Arg Asn Pro Val Asp Asn Ser Met Val Asn Ala Ser Met Val Pro 
35 40 45 

Phe Cys Leu Ser Ala Phe Leu Asn Val Leu Phe Phe Gly Cys Asn Gly 



Trp Asp He He Gly His Phe Trp Leu Gly His Pro Ala Asn Gin Asn 
65 70 75 80 

Pro Pro Val Leu Ser He Thr He Tyr Phe Ser lie Arg Gly Leu Met 



Leu Tyr Leu Lys Arg Lys Glu He Val Glu Phe Val Asn Asp Leu Asp 
100 105 110 

Arg Glu Cys Pro Arg Asp Leu Val Ser Gin Leu Asp Met Gin Met Asp 
115 120 125 

Glu Thr Tyr Arg Asn Phe Trp Gin Arg Tyr Arg Phe He Arg He Tyr 
130 135 140 

Ser His Leu Gly Gly Pro Met Phe Cys Val Val Pro Leu Ala Leu Phe 
145 150 155 160 

Leu Leu Thr His Glu Gly Lys Asp Thr Pro Val Ala Gin His Glu Gin 
165 170 175 

Leu Leu Gly Gly Trp Leu Pro Cys Gly Val Arg Lys Asp Pro Asn Phe 
180 185 190 

Tyr Leu Leu Val Trp Ser Phe Asp Leu Met Cys Thr Thr Cys Gly Val 
195 200 205 

Ser Phe Phe Val Thr Phe Asp Asn Leu Phe Asn Val Met Gin Gly His 
210 215 220 

Leu Val Met His Leu Gly His Leu Ala Arg Gin Phe Ser Ala He Asp 
225 230 235 240 
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Pro Arg Gin Ser 



Leu Leu Val Gin 
260 

Asn Asp lie Phe 
275 

Gly Ser Leu Cys 
290 

Leu He He Ala 
305 

Thr Phe Glu He 



Gly Leu Glu Ser 
340 

Arg Tyr Arg Lys 
355 

Gin Gin Leu Gly 
370 

Thr Glu He Met 
385 

His 



Leu Thr Asp Glu 
245 

Arg Gin Gin Leu 



Lys Val Ala Phe 
280 

Phe Tyr Leu Phe 
295 

Gin Tyr He Leu 
310 

Cys Leu Arg Gly 
325 

Ser Leu Arg Ser 



Phe Tyr Leu Leu 
360 

Ala Phe Gly Leu 
375 

Gin Leu Ala Tyr 
390 



Lys Arg Phe Phe 
250 

Leu Asn Gly Leu 
265 

Leu Val Ser Asn 



Met Leu Ser Glu 
300 

Pro Thr Leu Val 
315 

Thr Gin Leu Glu 
330 

Gin Glu Trp Tyr 
345 

Trp Thr Gin Tyr 



He Gin Val Asn 
380 

Arg Leu Phe Thr 
395 



Val Asp Leu Arg 
255 

Cys Arg Lys Tyr 
270 

Phe Val Gly Ala 
285 

Thr Ser Asp Val 



Leu Val Gly Phe 
320 

Lys Ala Ser Glu 
335 

Leu Gly Ser Arg 
350 

Cys Gin Arg Thr 
365 

Met Val His Phe 



Phe Leu Lys Ser 
400 



<210> 57 
<211> 1131 
<212> DNA 

<213> Drosophila melanogaster 

<220> 
<221> CDS 
<222> (1) . . (1131) 
<223> DOR 92E.1 

<400> 57 

atg act ttc tac aag acc ate ggc gag gat ctg tac tec gat agg gat 48 



114 



Met Thr Phe Tyr Lys Thr He Gly Glu Asp Leu Tyr Ser Asp Arg Asp 
1 5 10 15 

ccg aat gtg ata agg cgt tac ctg eta cgt ttt tat ctg gta etc ggt 
Pro Asn Val He Arg Arg Tyr Leu Leu Arg Phe Tyr Leu Val Leu Gly 
20 25 30 

ttt etc aac ttc aat gec tat gtg gtg ggc gaa ate gcg tac ttt ata 
Phe Leu Asn Phe Asn Ala Tyr Val Val Gly Glu He Ala Tyr Phe lie 
35 40 " 45 

gtc cat ata atg teg acg act act ctt ttg gag gee act gca gtg gca 
Val His He Met Ser Thr Thr Thr Leu Leu Glu Ala Thr Ala Val Ala 
50 55 60 

ccg tgc att ggt ttc age ttc atg gec gac ttt aag cag ttc ggt etc 
Pro Cys He Gly Phe Ser Phe Met Ala Asp Phe Lys Gin Phe Gly Leu 
65 70 75 80 

aca gtg aat aga aag cga ttg gtc aga ttg ctg gat gat etc aag gag 
Thr Val Asn Arg Lys Arg Leu Val Arg Leu Leu Asp Asp Leu Lys Glu 
85 90 95 

ata ttt cct tta gat tta gaa gcg cag egg aag tat aac gta teg ttt 
He Phe Pro Leu Asp Leu Glu Ala Gin Arg Lys Tyr Asn Val Ser Phe 
100 105 110 

tac egg aaa cac atg aac agg gtc atg acc eta ttc ace ate etc tgc 
Tyr Arg Lys His Met Asn Arg Val Met Thr Leu Phe Thr He Leu Cys 
115 120 125 

atg acc tac acc teg tea ttt age ttt tat cca gee ate aag teg acc 
Met Thr Tyr Thr Ser Ser Phe Ser Phe Tyr Pro Ala He Lys Ser Thr 
130 135 140 

ata aag tat tac ctt atg gga teg gaa ate ttt gag cgc aac tac gga 
He Lys Tyr Tyr Leu Met Gly Ser Glu He Phe Glu Arg Asn Tyr Gly 
145 150 155 160 

ttt cac att ttg ttt ccc tac gac gca gaa acg gat ctg acg gtc tac 
Phe His He Leu Phe Pro Tyr Asp Ala Glu Thr Asp Leu Thr Val Tyr 
165 170 175 

tgg ttt tec tac tgg gga ttg get cat tgt gee tat gtg gec gga gtt 
Trp Phe Ser Tyr Trp Gly Leu Ala His Cys Ala Tyr Val Ala Gly Val 
180 185 190 

tec tac gtc tgc gtg gat etc ctg ctg ate gcg acc ata acc cag ctg 
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Ser Tyr Val Cys Val Asp Leu Leu Leu He Ala Thr He Thr Gin Leu 
195 200 205 

acc atg cac ttc aac ttt ata gcg aat gat ttg gag gcc tac gaa gga 
Thr Met His Phe Asn Phe He Ala Asn Asp Leu Glu Ala Tyr Glu Gly 
210 215 220 

ggt gat cat acg gat gaa gaa aat ate aaa tac ctg cac aac ttg gtc 
Gly Asp His Thr Asp Glu Glu Asn He Lys Tyr Leu His Asn Leu Val 
225 230 235 240 

gtc tat cat gcc agg gcg ctg gac etc age gag gag gtc aac aac ata 
Val Tyr His Ala Arg Ala Leu Asp Leu Ser Glu Glu Val Asn Asn He 
245 250 255 

ttc age ttc ctg ate ctg tgg aac ttt att gcc gca teg etc gtg att 
Phe Ser Phe Leu He Leu Trp Asn Phe He Ala Ala Ser Leu Val He 
260 265 270 

tgc ttc get ggc ttt cag att aca gcc tea aat gtg gag gac ata ggg 
Cys Phe Ala Gly Phe Gin He Thr Ala Ser Asn Val Glu Asp He Gly 
275 280 285 

gtg tac ttc ata ttt ttt tea get teg ctg gtt caa gtc ttt aaa tgt 
Val Tyr Phe He Phe Phe Ser Ala Ser Leu Val Gin Val Phe Lys Cys 
290 295 300 

tct ttt cag age tct egg att ggc cat teg gca ttt aat cag aac tgg 
Ser Phe Gin Ser Ser Arg He Gly His Ser Ala Phe Asn Gin Asn Trp 
305 310 315 320 

ttg cca tgc age acc aaa tac aaa cgc ate ctg cag ttt att ate gcg 
Leu Pro Cys Ser Thr Lys Tyr Lys Arg He Leu Gin Phe He He Ala 
325 330 335 

cgc age cag aag ccc gcc tct ata aga ccg cct acc ttt cca ccc ata 
Arg Ser Gin Lys Pro Ala Ser He Arg Pro Pro Thr Phe Pro Pro lie 
340 345 350 

tct ttt aat acc ttt atg aag gta ate age atg teg tat cag ttt ttt 
Ser Phe Asn Thr Phe Met Lys Val He Ser Met Ser Tyr Gin Phe Phe 
355 360 ' 365 

gca ctg etc cgc acc aca tat tat ggt 
Ala Leu Leu Arg Thr Thr Tyr Tyr Gly 
370 375 
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<210> 58 
<211> 377 
<212> PRT 

<213> Drosophila raelanogaster 
<400> 58 

Met Thr Phe Tyr Lys Thr He Gly Glu Asp Leu Tyr Ser Asp Arg Asp 
15 10 15 

Pro Asn Val He Arg Arg Tyr Leu Leu Arg Phe Tyr Leu Val Leu Gly 
20 25 30 

Phe Leu Asn Phe Asn Ala Tyr Val Val Gly Glu He Ala Tyr Phe He 



Val His He Met Ser Thr Thr Thr Leu Leu Glu Ala Thr Ala Val Ala 
50 55 60 

Pro Cys He Gly Phe Ser Phe Met Ala Asp Phe Lys Gin Phe Gly Leu 



Thr Val Asn Arg Lys Arg Leu Val Arg Leu Leu Asp Asp Leu Lys Glu 



He Phe Pro Leu Asp Leu Glu Ala Gin Arg Lys Tyr Asn Val Ser Phe 
100 105 110 

Tyr Arg Lys His Met Asn Arg Val Met Thr Leu Phe Thr He Leu Cys 
115 120 125 

Met Thr Tyr Thr Ser Ser Phe Ser Phe Tyr Pro Ala He Lys Ser Thr 
130 135 140 

He Lys Tyr Tyr Leu Met Gly Ser Glu lie Phe Glu Arg Asn Tyr Gly 
145 150 155 160 

Phe His He Leu Phe Pro Tyr Asp Ala Glu Thr Asp Leu Thr Val Tyr 
165 170 175 

Trp Phe Ser Tyr Trp Gly Leu Ala His Cys Ala Tyr Val Ala Gly Val 
180 185 190 

Ser Tyr Val Cys Val Asp Leu Leu Leu lie Ala Thr He Thr Gin Leu 
195 200 205 

Thr Met His Phe Asn Phe He Ala Asn Asp Leu Glu Ala Tyr Glu Gly 
210 215 220 
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Gly Asp His Thr 
225 

Val Tyr His Ala 



Phe Ser Phe Leu 
260 

Cys Phe Ala Gly 
275 

Val Tyr Phe He 
290 

Ser Phe Gin Ser 
305 

Leu Pro Cys Ser 



Arg Ser Gin Lys 
340 

Ser Phe Asn Thr 
355 

Ala Leu Leu Arg 
370 



Asp Glu Glu Asn 
230 

Arg Ala Leu Asp 
245 

He Leu Trp Asn 



Phe Gin He Thr 
280 

Phe Phe Ser Ala 
295 

Ser Arg He Gly 
310 

Thr Lys Tyr Lys 
325 

Pro Ala Ser He 



Phe Met Lys Val 
360 

Thr Thr Tyr Tyr 
375 



He Lys Tyr Leu 
235 

Leu Ser Glu Glu 
250 

Phe He Ala Ala 
265 

Ala Ser Asn Val 



Ser Leu Val Gin 
300 

His Ser Ala Phe 
315 

Arg He Leu Gin 
330 

Arg Pro Pro Thr 
345 

He Ser Met Ser 



Gly 



His Asn Leu Val 
240 

Val Asn Asn He 
255 

Ser Leu Val He 
270 

Glu Asp He Gly 
285 

Val Phe Lys Cys 



Asn Gin Asn Trp 
320 

Phe He lie Ala 
335 

Phe Pro Pro lie 
350 

Tyr Gin Phe Phe 
365 



<210> 59 
<211> 1161 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1161) 

<223> DOR 94D.1 

<400> 59 

atg gat aaa cac aag gat cgc att gaa tec atg cgc eta att ctt cag 48 
Met Asp Lys His Lys Asp Arg lie Glu Ser Met Arg Leu lie Leu Gin 
15 10 15 

gtc atg caa eta ttt ggc etc tgg ccg tgg tec ttg aaa teg gaa gag 96 
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Val Met Gin Leu Phe Gly Leu Trp Pro Trp Ser Leu Lys Ser Glu Glu 
20 25 30 

gag tgg act ttc acc ggt ttt gta aag cgc aac tat cgc ttc ctg etc 
Glu Trp Thr Phe Thr Gly Phe Val Lys Arg Asn Tyr Arg Phe Leu Leu 



cat ctg ccc att acc ttc acc ttt att gga etc atg tgg ctg gag gec 
His Leu Pro lie Thr Phe Thr Phe lie Gly Leu Met Trp Leu Glu Ala 
50 55 60 

ttc ate teg age aat ctg gag cag get ggc cag gtt ctg tac atg tec 
Phe lie Ser Ser Asn Leu Glu Gin Ala Gly Gin Val Leu Tyr Met Ser 
65 70 75 80 

ate acc gag atg get ttg gtg gtg aaa ate ctg age att tgg cac tat 
He Thr Glu Met Ala Leu Val Val Lys He Leu Ser He Trp His Tyr 
85 90 95 

cgc acc gaa get tgg egg ctg atg tac gaa etc caa cat get ccg gac 
Arg Thr Glu Ala Trp Arg Leu Met Tyr Glu Leu Gin His Ala Pro Asp 
100 105 110 

tac caa etc cac aac cag gag gag gta gac ttt tgg cgc egg gag caa 
Tyr Gin Leu His Asn Gin Glu Glu Val Asp Phe Trp Arg Arg Glu Gin 
115 120 125 

cga ttc ttc aag tgg ttc ttc tac ate tac att ctg att age ttg ggc 
Arg Phe Phe Lys Trp Phe Phe Tyr He Tyr He Leu He Ser Leu Gly 
130 135 140 

gtg gta tat agt ggc tgc act gga gta ctt ttt ctg gag ggc tac gaa 
Val Val Tyr Ser Gly Cys Thr Gly Val Leu Phe Leu Glu Gly Tyr Glu 
145 150 155 160 

ctg ccc ttt gec tac tac gtg ccc ttc gaa tgg cag aac gag aga agg 
Leu Pro Phe Ala Tyr Tyr Val Pro Phe Glu Trp Gin Asn Glu Arg Arg 
165 170 175 

tac tgg ttc gec tat ggt tac gat atg gcg ggc atg acg ctg acc tgc 
Tyr Trp Phe Ala Tyr Gly Tyr Asp Met Ala Gly Met Thr Leu Thr Cys 
180 185 190 

ate tea aac att acc ctg gac acc ctg ggt tgc tat ttc ctg ttc cat 
He Ser Asn He Thr Leu Asp Thr Leu Gly Cys Tyr Phe Leu Phe His 
195 200 205 

ate tct ctt ttg tac cga ctg ctt ggt ctg cga ttg agg gaa acg aag 
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He Ser Leu Leu Tyr Arg Leu Leu Gly Leu Arg Leu Arg Glu Thr Lys 
210 215 220 



aat atg aag aat gat acc att ttt ggc cag cag ttg cgt gcc ate ttc 
Asn Met Lys Asn Asp Thr He Phe Gly Gin Gin Leu Arg Ala He Phe 
225 230 235 240 

att atg cat cag agg att aga age eta acc ctg acc tgc cag aga ate 
He Met His Gin Arg He Arg Ser Leu Thr Leu Thr Cys Gin Arg He 
245 250 255 

gta tct ccc tat ate eta tct cag ate att ttg agt gcc ctg ate ate 
Val Ser Pro Tyr He Leu Ser Gin He He Leu Ser Ala Leu He He 
260 265 270 

tgc ttt agt gga tac cgc ttg cag cat gtg gga att cgc gat aat ccc 
Cys Phe Ser Gly Tyr Arg Leu Gin His Val Gly lie Arg Asp Asn Pro 
275 280 285 

ggc cag ttt ata tec atg ttg cag ttt gtc agt gtg atg ate ctg cag 
Gly Gin Phe He Ser Met Leu Gin Phe Val Ser Val Met He Leu Gin 
290 295 300 

att tac ttg ccc tgc tac tat gga aac gag ata acc gtg tat gcc aat 
lie Tyr Leu Pro Cys Tyr Tyr Gly Asn Glu He Thr Val Tyr Ala Asn 
305 310 315 320 

cag ctg acc aac gag gtt tac cat acc aat tgg ctg gaa tgt egg cca 
Gin Leu Thr Asn Glu Val Tyr His Thr Asn Trp Leu Glu Cys Arg Pro 
325 330 335 

ccg att cga aag tta etc aat gcc tac atg gag cac ctg aag aaa ccg 
Pro He Arg Lys Leu Leu Asn Ala Tyr Met Glu His Leu Lys Lys Pro 
340 345 350 

gtg acc ate egg get ggc aac tac ttc gcc gtg gga eta cca att ttt 
Val Thr He Arg Ala Gly Asn Tyr Phe Ala Val Gly Leu Pro He Phe 
355 360 365 

gtt aag acc ate aac aac gcc tac agt ttc ttg get tta tta eta aat 
Val Lys Thr He Asn Asn Ala Tyr Ser Phe Leu Ala Leu Leu Leu Asn 
370 375 380 

gta teg aat 
Val Ser Asn 
385 
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<210> 60 
<211> 387 
<212> PRT 

<213> Drosophila melanogaster 
<400> 60 

Met Asp Lys His Lys Asp Arg lie Glu Ser Met Arg Leu lie Leu Gin 



Val Met Gin Leu Phe Gly Leu Trp Pro Trp Ser Leu Lys Ser Glu Glu 
20 25 30 

Glu Trp Thr Phe Thr Gly Phe Val Lys Arg Asn Tyr Arg Phe Leu Leu 
35 40 45 

His Leu Pro He Thr Phe Thr Phe He Gly Leu Met Trp Leu Glu Ala 
50 55 60 

Phe He Ser Ser Asn Leu Glu Gin Ala Gly Gin Val Leu Tyr Met Ser 
65 70 75 80 

lie Thr Glu Met Ala Leu Val' Val Lys He Leu Ser He Trp His Tyr 



Arg Thr Glu Ala Trp Arg Leu Met Tyr Glu Leu Gin His Ala Pro Asp 
100 105 HO 

Tyr Gin Leu His Asn Gin Glu Glu Val Asp Phe Trp Arg Arg Glu Gin 
115 120 125 

Arg Phe Phe Lys Trp Phe Phe Tyr He Tyr He Leu He Ser Leu Gly 
130 135 140 

Val Val Tyr Ser Gly Cys Thr Gly Val Leu Phe Leu Glu Gly Tyr Glu 
145 150 155 160 

Leu Pro Phe Ala Tyr Tyr Val Pro Phe Glu Trp Gin Asn Glu Arg Arg 
165 170 175 

Tyr Trp Phe Ala Tyr Gly Tyr Asp Met Ala Gly Met Thr Leu Thr Cys 
180 185 190 

He Ser Asn He Thr Leu Asp Thr Leu Gly Cys Tyr Phe Leu Phe His 
195 200 205 

He Ser Leu Leu Tyr Arg Leu Leu Gly Leu Arg Leu Arg Glu Thr Lys 
210 215 220 
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Asn Met Lys Asn 
225 

He Met His Gin 



Val Ser Pro Tyr 
260 

Cys Phe Ser Gly 
275 

Gly Gin Phe He 
290 

He Tyr Leu Pro 
305 

Gin Leu Thr Asn 



Pro He Arg Lys 
340 

Val Thr He Arg 
355 

Val Lys Thr He 
370 

Val Ser Asn 
385 



Asp Thr He Phe 
230 

Arg He Arg Ser 
245 

He Leu Ser Gin 



Tyr Arg Leu Gin 
280 

Ser Met Leu Gin 
295 

Cys Tyr Tyr Gly 
310 

Glu Val Tyr His 
325 

Leu Leu Asn Ala 



Ala Gly Asn Tyr 
360 

Asn Asn Ala Tyr 
375 



Gly Gin Gin Leu 
235 

Leu Thr Leu Thr 

250 

He He Leu Ser 
265 

His Val Gly He 



Phe Val Ser Val 
300 

Asn Glu He Thr 
315 

Thr Asn Trp Leu 
330 

Tyr Met Glu His 
345 

Phe Ala Val Gly 



Ser Phe Leu Ala 
380 



Arg Ala He Phe 

240 

Cys Gin Arg He 
255 

Ala Leu He He 
270 

Arg Asp Asn Pro 
285 

Met He Leu Gin 



Val Tyr Ala Asn 
320 

Glu Cys Arg Pro 
335 

Leu Lys Lys Pro 
350 

Leu Pro He Phe 
365 

Leu Leu Leu Asn 



<210> 61 
<211> 1101 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1101) 

<400> 61 

atg gag tct aca aat cgc eta agt gec ate caa aca ctt tta gta ate 

Met Glu Ser Thr Asn Arg Leu Ser Ala He Gin Thr Leu Leu Val He 

15 10 15 

122 



m 



caa cgt tgg ata gga ctt ctt aaa tgg gaa aac gag ggc gag gat gga 
Gin Arg Trp He Gly Leu Leu Lys Trp Glu Asn Glu Gly Glu Asp Gly 
20 25 30 

gta tta acc tgg eta aaa cga ata tat cct ttt gta ctg cac ctt cca 
Val Leu Thr Trp Leu Lys Arg He Tyr Pro Phe Val Leu His Leu Pro 



ctg acc ttc acg tat att gec tta atg tgg tat gaa get att aca teg 
Leu Thr Phe Thr Tyr He Ala Leu Met Trp Tyr Glu Ala He Thr Ser 



tea gat ttt gag gaa get ggt caa gtt ctg tac atg tec ate acc gaa 
Ser Asp Phe Glu Glu Ala Gly Gin Val Leu Tyr Met Ser He Thr Glu 



ctg gca ttg gtc act aaa ctg ctg aat att tgg tat cgt cgt cat gaa 
Leu Ala Leu Val Thr Lys Leu Leu Asn He Trp Tyr Arg Arg His Glu 
85 90 95 

get get agt eta ate cac gaa ttg caa cac gat ccc gca ttt aat ctg 
Ala Ala Ser Leu He His Glu Leu Gin His Asp Pro Ala Phe Asn Leu 
100 105 HO 

cgc aat teg gag gaa ate aaa ttc tgg gag caa aat cag agg aac ttt 
Arg Asn Ser Glu Glu He Lys Phe Trp Gin Gin Asn Gin Arg Asn Phe 
115 120 125 

aag aga ata ttt tac tgg tac ate tgg ggc age ctt ttc gtg get gta 
Lys Arg He Phe Tyr Trp Tyr He Trp Gly Ser Leu Phe Val Ala Val 
130 135 140 

atg ggt tat ata age gtg ttt ttc cag gag gat tac gag ctg ccc ttt 
Met Gly Tyr He Ser Val Phe Phe Gin Glu Asp Tyr Glu Leu Pro Phe 
145 150 155 160 

ggc tac tac gtg cca ttc gag tgg cgc acc agg gaa cga tac ttc tac 
Gly Tyr Tyr Val Pro Phe Glu Trp Arg Thr Arg Glu Arg Tyr Phe Tyr 
165 170 175 

get tgg ggc tat aat gtg gtg gec atg acc ctg tgc tgt eta tec aac 
Ala Trp Gly Tyr Asn Val Val Ala Met Thr Leu Cys Cys Leu Ser Asn 
180 185 190 

ate eta ctg gac aca eta ggc tgt tat ttc atg ttc cac ate gec teg 
He Leu Leu Asp Thr Leu Gly Cys Tyr Phe Met Phe His He Ala Ser 
195 200 205 
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ctt ttc agg ctt ttg gga atg cga ctg gag gcc ttg aaa aat gca gcc 
Leu Phe Arg Leu Leu Gly Met Arg Leu Glu Ala Leu Lys Asn Ala Ala 
210 215 220 
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gaa gag aaa gcc aga ccg gag ttg cgc cgc att ttc caa ctg cac act 
Glu Glu Lys Ala Arg Pro Glu Leu Arg Arg He Phe Gin Leu His Thr 
225 230 235 240 

aaa gtc cgc cga ttg acg agg gaa tgc gaa gtg tta gtt tea ccc tat 
Lys Val Arg Arg Leu Thr Arg Glu Cys Glu Val Leu Val Ser Pro Tyr 
245 250 255 

gtt eta tec caa gtg gtc ttc agt gcc ttc ate ate tgc ttc agt gcc 
Val Leu Ser Gin Val Val Phe Ser Ala Phe He He Cys Phe Ser Ala 
260 265 270 

tat cga ctg gtg cac atg ggc ttc aag cag cga cct gga etc ttc gtg 
Tyr Arg Leu Val His Met Gly Phe Lys Gin Arg Pro Gly Leu Phe Val 
275 280 285 

acc acc gtg caa ttc gtg gcc gtc atg ate gtc cag att ttc ttg ccc 
Thr Thr Val Gin Phe Val Ala Val Met He Val Gin He Phe Leu Pro 
290 295 300 

tgt tac tac ggc aat gag ttg acc ttt cat gcc aat gca etc act aat 
Cys Tyr Tyr Gly Asn Glu Leu Thr Phe His Ala Asn Ala Leu Thr Asn 
305 310 315 320 

agt gtc ttc ggt acc aat tgg ctg gag tac tec gtg ggc act cgc aag 
Ser Val Phe Gly Thr Asn Trp Leu Glu Tyr Ser Val Gly Thr Arg Lys 
325 330 335 

ctg ctt aac tgc tac atg gag ttc etc aag cga ccg gtt aaa acc ate 
Leu Leu Asn Cys Tyr Met Glu Phe Leu Lys Arg Pro Val Lys Thr He 
340 345 350 

aac aat gcc tac agt ttc ttc gcc ctg ctg eta aag ata tec aag 
Asn Asn Ala Tyr Ser Phe Phe Ala Leu Leu Leu Lys He Ser Lys 
355 360 365 



<210> 62 
<211> 367 
<212> PRT 

<213> Drosophila melanogaster 
<400> 62 
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Met Glu Ser Thr Asn Arg Leu Ser Ala He Gin Thr Leu Leu Val He 
15 10 15 

Gin Arg Trp He Gly Leu Leu Lys Trp Glu Asn Glu Gly Glu Asp Gly 
20 25 30 

Val Leu Thr Trp Leu Lys Arg He Tyr Pro Phe Val Leu His Leu Pro 
35 40 45 

Leu Thr Phe Thr Tyr He Ala Leu Met Trp Tyr Glu Ala He Thr Ser 
50 55 60 

Ser Asp Phe Glu Glu Ala Gly Gin Val Leu Tyr Met Ser He Thr Glu 
65 70 75 80 

Leu Ala Leu Val Thr Lys Leu Leu Asn He Trp Tyr Arg Arg His Glu 



Ala Ala Ser Leu He His Glu Leu Gin His Asp Pro Ala Phe Asn Leu 
100 105 HO 

Arg Asn Ser Glu Glu He Lys Phe Trp Gin Gin Asn Gin Arg Asn Phe 
115 120 125 

Lys Arg He Phe Tyr Trp Tyr He Trp Gly Ser Leu Phe Val Ala Val 
130 135 140 

Met Gly Tyr He Ser Val Phe Phe Gin Glu Asp Tyr Glu Leu Pro Phe 
145 150 155 160 

Gly Tyr Tyr Val Pro Phe Glu Trp Arg Thr Arg Glu Arg Tyr Phe Tyr 
165 170 175 

Ala Trp Gly Tyr Asn Val Val Ala Met Thr Leu Cys Cys Leu Ser Asn 
180 185 190 

He Leu Leu Asp Thr Leu Gly Cys Tyr Phe Met Phe His He Ala Ser 
195 200 205 

Leu Phe Arg Leu Leu Gly Met Arg Leu Glu Ala Leu Lys Asn Ala Ala 
210 215 220 

Glu Glu Lys Ala Arg Pro Glu Leu Arg Arg He Phe Gin Leu His Thr 
225 230 235 240 

Lys Val Arg Arg Leu Thr Arg Glu Cys Glu Val Leu Val Ser Pro Tyr 
245 250 255 
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Val Leu Ser Gin 
260 

Tyr Arg Leu Val 
275 

Thr Thr Val Gin 
290 

Cys Tyr Tyr Gly 

305 

Ser Val Phe Gly 



Leu Leu Asn Cys 
340 

Asn Asn Ala Tyr 
355 



Val Val Phe Ser 



His Met Gly Phe 
280 

Phe Val Ala Val 
295 

Asn Glu Leu Thr 
310 

Thr Asn Trp Leu 
325 

Tyr Met Glu Phe 



Ser Phe Phe Ala 
360 



Ala Phe He He 
265 

Lys Gin Arg Pro 



Met He Val Gin 
300 

Phe His Ala Asn 

315 

Glu Tyr Ser Val 
330 

Leu Lys Arg Pro 
345 

Leu Leu Leu Lys 



Cys Phe Ser Ala 
270 

Gly Leu Phe Val 
285 

He Phe Leu Pro 



Ala Leu Thr Asn 
320 

Gly Thr Arg Lys 
335 

Val Lys Thr He 
350 

He Ser Lys 
365 



<210> 63 
<211> 1095 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1095) 

<22 3> DORLU 1.1 

<400> 63 

atg tgg etc ate gga tgg att ccg ccg 
Met Trp Leu He Gly Trp He Pro Pro 

1 ' 5 

gtg tat etc ttc tgg acc tgc gtg ccc 
Val Tyr Leu Phe Trp Thr Cys Val Pro 
20 25 

ctg ccc gtg ggc ttc ate ate age tac 
Leu Pro Val Gly Phe He He Ser Tyr 
35 40 

acg ccg ggc gag ttc ctt acc teg ctg 
Thr Pro Gly Glu Phe Leu Thr Ser Leu 



aag gag gga gtc ctg cgc tac 48 
Lys Glu Gly Val Leu Arg Tyr 
10 15 

ttc gec ttc ggg gtg ttt tac 96 
Phe Ala Phe Gly Val Phe Tyr 
30 

gtg cag gag ttc aag aac ttc 144 
Val Gin Glu Phe Lys Asn Phe 
45 

cag gtg tgc ate aat gtg tat 192 
Gin Val Cys He Asn Val Tyr 
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50 

ggc gcc teg gtg 
Gly Ala Ser Val 
65 

cgc aag acg <#ag 
Arg Lys Thr Glu 



gac age gat cgc 
Asp Ser Asp Arg 
100 

gcc ttt etc ate 
Ala Phe Leu lie 
115 

ttc ctg tec tac 
Phe Leu Ser Tyr 
130 

ccc ttc ate gat 
Pro Phe lie Asp 
145 

ata ttc gag tac 
lie Phe Glu Tyr 



tec gac acg tat 
Ser Asp Thr Tyr 

180 

gag gtc etc aag 
Glu Val Leu Lys 
195 

agt gag gca gac 
Ser Glu Ala Asp 
210 

aag act ata ctg 
Lys Thr lie Leu 
225 

acc ate ttc gtg 
Thr He Phe Val 



55 

aag tec acc 
Lys Ser Thr 
70 

ate ctt ctg 
He Leu Leu 
85 

gag agg ate 
Glu Arg He 



tac age ttc 
Tyr Ser Phe 



gcc etc agt 
Ala Leu Ser 
135 

tgg cgc gat 
Trp Arg Asp 
150 

ate acc atg 
He Thr Met 
165 

ccc ctg atg 
Pro Leu Met 



gat cac gtg 
Asp His Val 



aac tat cag 
Asn Tyr Gin 
215 

aaa tgc tgt 
Lys Cys Cys 
230 

caa ttc gcg 
Gin Phe Ala 



ate acc tac etc 
He Thr Tyr Leu 
75 

gac tec ctg gac 
Asp Ser Leu Asp 
90 

cac aat atg gtg 
His Asn Met Val 
105 

ate tac tgc gga 
He Tyr Cys Gly 
120 

ggt cgt cct ccg 
Gly Arg Pro Pro 



ggc atg ggc age 
Gly Met Gly Ser 
155 

tec ttc gcc gtg 
Ser Phe Ala Val 
170 

ttc acc att atg 
Phe Thr He Met 
185 

egg age ctg cgc 
Arg Ser Leu Arg 
200 

gat ctg gtg aac 
Asp Leu Val Asn 



gac atg att cgc 
Asp Met He Arg 
235 

ctg att ggt tec 
Leu He Gly Ser 



60 

ttc etc tgg 
Phe Leu Trp 



aag agg ctg 
Lys Arg Leu 



gcg cgc tgc 
Ala Arg Cys 
110 

tac gcg ggt 
Tyr Ala Gly 
125 

tgg tec gtc 
Trp Ser Val 
140 

ctg tgg ate 
Leu Trp He 



ctg cag gac 
Leu Gin Asp 



ttc egg gcc 
Phe Arg Ala 
190 

atg gat ccc 
Met Asp Pro 
205 

tgc gtg ctg 
Cys Val Leu 
220 

ccc atg ata 
Pro Met He 



gtt ttg ggc 
Val Leu Gly 



cga ctg 240 
Arg Leu 
80 

gcg aac 288 
Ala Asn 
95 

aac tac 336 
Asn Tyr 



tec act 384 
Ser Thr 



tac aat 432 
Tyr Asn 



cag gcc 480 
Gin Ala 
160 

cag eta 528 

Gin Leu 

175 

cac atg 576 
His Met 



gag cgc 62 4 
Glu Arg 



gac cac 672 
Asp His 



tec cgc 720 
Ser Arg 
240 

ctg acc 7 68 
Leu Thr 
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245 



250 



255 



ctg gtg aac gtg ttc ttc ttc teg aac ttc tgg aag ggc gtg gec teg 816 
Leu Val Asn Val Phe Phe Phe Ser Asn Phe Trp Lys Gly Val Ala Ser 
260 265 270 

etc ctg ttc gtc ate ace ate ctg ctg cag acc ttc ccg ttc tgc tac 864 
Leu Leu Phe Val lie Thr lie Leu Leu Gin Thr Phe Pro Phe Cys Tyr 
275 280 285 

acc tgc aac atg ctg ate gac gat gee cag gat ctg tec aac gag att 912 
Thr Cys Asn Met Leu lie Asp Asp Ala Gin Asp Leu Ser Asn Glu lie 
290 295 300 

ttc cag tec aac tgg gtg gac gcg gag ccg cgc tac aag gcg acg ctg 960 
Phe Gin Ser Asn Trp Val Asp Ala Glu Pro Arg Tyr Lys Ala Thr Leu 
305 310 315 320 

gtg etc ttc atg cac cat gtt cag cag ccc ata ate ttc att gec gga 1008 
Val Leu Phe Met His His Val Gin Gin Pro lie lie Phe lie Ala Gly 
325 330 335 

ggc ate ttt ccc ate tct atg aac age aac ata acc gta agg att act 1056 
Gly lie Phe Pro lie Ser Met Asn Ser Asn lie Thr Val Arg lie Thr 
340 345 350 

tct ttc ctg cca act gec tac ttc aca ttt gac cca ttt 1095 
Ser Phe Leu Pro Thr Ala Tyr Phe Thr Phe Asp Pro Phe 
355 360 365 



<210> 64 
<211> 365 
<212> PRT 

<213> Drosophila melanogaster 
<400> 64 

Met Trp Leu lie Gly Trp lie Pro Pro Lys Glu Gly Val Leu Arg Tyr 
15 10 15 

Val Tyr Leu Phe Trp Thr Cys Val Pro Phe Ala Phe Gly Val Phe Tyr 
20 25 30 

Leu Pro Val Gly Phe lie lie Ser Tyr Val Gin Glu Phe Lys Asn Phe 
35 40 45 

Thr Pro Gly Glu Phe Leu Thr Ser Leu Gin Val Cys lie Asn Val Tyr 
50 55 60 



128 



Gly Ala Ser Val Lys Ser Thr lie Thr Tyr Leu Phe Leu Trp Arg Leu 
65 70 75 80 



Arg Lys Thr Glu lie Leu Leu Asp Ser Leu Asp Lys Arg Leu Ala Asn 
85 90 95 

Asp Ser Asp Arg Glu Arg lie His Asn Met Val Ala Arg Cys Asn Tyr 
100 105 110 

Ala Phe Leu lie Tyr Ser Phe lie Tyr Cys Gly Tyr Ala Gly Ser Thr 
115 120 125 

Phe Leu Ser Tyr Ala Leu Ser Gly Arg Pro Pro Trp Ser Val Tyr Asn 
130 135 140 

Pro Phe lie Asp Trp Arg Asp Gly Met Gly Ser Leu Trp lie Gin Ala 
145 150 155 160 

lie Phe Glu Tyr lie Thr Met Ser Phe Ala Val Leu Gin Asp Gin Leu 
165 170 175 

Ser Asp Thr Tyr Pro Leu Met Phe Thr lie Met Phe Arg Ala His Met 
180 185 190 

Glu Val Leu Lys Asp His Val Arg Ser Leu Arg Met Asp Pro Glu Arg 
195 200 205 

Ser Glu Ala Asp Asn Tyr Gin Asp Leu Val Asn Cys Val Leu Asp His 
210 215 220 

Lys Thr lie Leu Lys Cys Cys Asp Met lie Arg Pro Met lie Ser Arg 
225 230 235 240 

Thr lie Phe Val Gin Phe Ala Leu lie Gly Ser Val Leu Gly Leu Thr 
245 250 255 

Leu Val Asn Val Phe Phe Phe Ser Asn Phe Trp Lys Gly Val Ala Ser 
260 265 270 

Leu Leu Phe Val lie Thr lie Leu Leu Gin Thr Phe Pro Phe Cys Tyr 
275 280 285 

Thr Cys Asn Met Leu lie Asp Asp Ala Gin Asp Leu Ser Asn Glu lie 
290 295 300 

Phe Gin Ser Asn Trp Val Asp Ala Glu Pro Arg Tyr Lys Ala Thr Leu 
305 310 315 320 
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Val Leu Phe Met His His Val Gin Gin Pro lie He Phe He Ala Gly 
325 330 335 



Gly He Phe Pro He Ser Met Asn Ser Asn He Thr Val Arg He Thr 

340 345 350 

Ser Phe Leu Pro Thr Ala Tyr Phe Thr Phe Asp Pro Phe 
355 360 365 



<210> 65 
<211> 1233 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1233) 

<223> DORLU 2.1 

<400> 65 

atg acc aag ttc ttc ttc aag cgc ctg caa act get cca ctt gat cag 
Met Thr Lys Phe Phe Phe Lys Arg Leu Gin Thr Ala Pro Leu Asp Gin 
15 10 15 

gag gtg agt tec ctt gat gec age gac tac tac tac cgc ate gca ttt 
Glu Val Ser Ser Leu Asp Ala Ser Asp Tyr Tyr Tyr Arg He Ala Phe 



ttc ctg ggc tgg acc ccg ccc aag ggg get ctg etc cga tgg ate tac 

Phe Leu Gly Trp Thr Pro Pro Lys Gly Ala Leu Leu Arg Trp He Tyr 

35 40 45 

tec ctg tgg act ctg acc acg atg tgg ctg ggt ate gtg tac ctg ccg 

Ser Leu Trp Thr Leu Thr Thr Met Trp Leu Gly He Val Tyr Leu Pro 

50 55 60 

etc gga ctg age etc acc tat gtg aag cac ttc gat aga ttc acg ccg 

Leu Gly Leu Ser Leu Thr Tyr Val Lys His Phe Asp Arg Phe Thr Pro 



acg gag ttc ctg acc tec ctg cag gtg gat ate aac tgc ate ggg aac 
Thr Glu Phe Leu Thr Ser Leu Gin Val Asp He Asn Cys He Gly Asn 



gtg ate aag tea tgc gta act tat tec cag atg tgg cgt ttt cgc egg 
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Val lie Lys Ser Cys Val Thr Tyr Ser Gin Met Trp Arg Phe Arg Arg 
100 105 110 

atg aat gag ctt ate teg tec ctg gac aag aga tgt gtg act acg aca 384 
Met Asn Glu Leu lie Ser Ser Leu Asp Lys Arg Cys Val Thr Thr Thr 
115 120 125 

cag cgt cga att ttc cat aag atg gtg gca egg gtt aat etc ate gtg 432 
Gin Arg Arg lie Phe His Lys Met Val Ala Arg Val Asn Leu lie Val 
130 135 140 

att ctg ttc ttg tec acg tac ttg ggc ttc tgc ttt eta act ctg ttc 480 
lie Leu Phe Leu Ser Thr Tyr Leu Gly Phe Cys Phe Leu Thr Leu Phe 
145 150 155 160 

act teg gtt ttc get ggc aaa get cct tgg cag ctg tac aac cca ctg 528 
Thr Ser Val Phe Ala Gly Lys Ala Pro Trp Gin Leu Tyr Asn Pro Leu 
165 170 175 

gtg gac tgg egg aaa ggc cat tgg cag eta tgg att gec tec ate ctg 576 
Val Asp Trp Arg Lys Gly His Trp Gin Leu Trp lie Ala Ser lie Leu 
180 185 190 

gag tac tgt gtg gtc tec att ggc acc atg cag gag ttg atg tec gac 624 
Glu Tyr Cys Val Val Ser lie Gly Thr Met Gin Glu Leu Met Ser Asp 
195 200 205 

acc tac gec ata gtg ttc ate tec ttg ttc cgc tgc cac ctg get att 672 
Thr Tyr Ala lie Val Phe He Ser Leu Phe Arg Cys His Leu Ala He 
210 215 220 

etc aga gat cgc ata get aat ctg egg cag gat ccg aaa etc agt gag 720 
Leu Arg Asp Arg He Ala Asn Leu Arg Gin Asp Pro Lys Leu Ser Glu 
225 230 235 240 

atg gaa cac tat gag cag atg gtg gee tgc att cag gat cat cga acc 768 
Met Glu His Tyr Glu Gin Met Val Ala Cys He Gin Asp His Arg Thr 
245 250 255 

ate ata cag tgc tec cag att att cga ccc ate ctg teg ate act ate 816 
He He Gin Cys Ser Gin He He Arg Pro He Leu Ser He Thr He 
260 265 270 

ttt gec cag ttc atg ctg gtt ggc att gac ttg ggt ctg gcg gec ate 864 
Phe Ala Gin Phe Met Leu Val Gly He Asp Leu Gly Leu Ala Ala He 
275 280 285 

age ate etc ttc ttt ccg aac acc att tgg acg ate atg gca aac gtg 912 
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Ser He Leu Phe Phe Pro Asn Thr He Trp Thr He Met Ala Asn Val 
290 295 300 



teg ttc ate gtg gec ate tgt aca gag 
Ser Phe He Val Ala He Cys Thr Glu 
305 310 

tgc gag cat ctg ate gag gac tec gtc 
Cys Glu His Leu He Glu Asp Ser Val 
325 

cac tea aac tgg ata acc gcg gac agg 
His Ser Asn Trp He Thr Ala Asp Arg 
340 345 

tat ttc ctg cac egg get cag caa ccc 
Tyr Phe Leu His Arg Ala Gin Gin Pro 
355 360 

ata ttt ccc att teg gtg cag age aac 
He Phe Pro He Ser Val Gin Ser Asn 
370 375 

ttc aca ate ate aca ate gtg aac caa 
Phe Thr He He Thr He Val Asn Gin 
385 390 

ttc agt gac agg age aat ggc gat ata 
Phe Ser Asp Arg Ser Asn Gly Asp He 
405 



tec ttt cca tgc tgc atg etc 960 
Ser Phe Pro Cys Cys Met Leu 
315 320 

cat gtg age aac gec ctg ttc 1008 
His Val Ser Asn Ala Leu Phe 
330 335 

age tac aag teg gcg gtt ctg 1056 
Ser Tyr Lys Ser Ala Val Leu 
350 

att caa ttc acg gec ggc tec 1104 
He Gin Phe Thr Ala Gly Ser 
365 

ata gec gtg gec aag ttc gcg 1152 
He Ala Val Ala Lys Phe Ala 
380 

atg aat ctg ggc gag aag ttc 1200 
Met Asn Leu Gly Glu Lys Phe 
395 400 

aat cct 1233 

Asn Pro 

410 



<210> 66 
<211> 411 
<212> PRT 

<213> Drosophila melanogaster 
<400> 66 

Met Thr Lys Phe Phe Phe Lys Arg Leu Gin Thr Ala Pro Leu Asp Gin 
15 10 15 

Glu Val Ser Ser Leu Asp Ala Ser Asp Tyr Tyr Tyr Arg He Ala Phe 
20 25 30 

Phe Leu Gly Trp Thr Pro Pro Lys Gly Ala Leu Leu Arg Trp He Tyr 
35 40 45 

Ser Leu Trp Thr Leu Thr Thr Met Trp Leu Gly He Val Tyr Leu Pro 
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50 



55 



60 



Leu Gly Leu Ser 

65 

Thr Glu Phe Leu 



Val He Lys Ser 
100 

Met Asn Glu Leu 
115 

Gin Arg Arg He 
130 

He Leu Phe Leu 
145 

Thr Ser Val Phe 



Val Asp Trp Arg 
180 

Glu Tyr Cys Val 
195 

Thr Tyr Ala He 
210 

Leu Arg Asp Arg 
225 

Met Glu His Tyr 



He He Gin Cys 
260 

Phe Ala Gin Phe 
275 

Ser He Leu Phe 
290 

Ser Phe He Val 



Leu Thr Tyr Val 
70 

Thr Ser Leu Gin 
85 

Cys Val Thr Tyr 



He Ser Ser Leu 
120 

Phe His Lys Met 
135 

Ser Thr Tyr Leu 
150 

Ala Gly Lys Ala 
165 

Lys Gly His Trp 



Val Ser He Gly 
200 

Val Phe He Ser 
215 

He Ala Asn Leu 
230 

Glu Gin Met Val 
245 

Ser Gin He He 



Met Leu Val Gly 
280 

Phe Pro Asn Thr 
295 

Ala He Cys Thr 



Lys His Phe Asp 
75 

Val Asp He Asn 
90 

Ser Gin Met Trp 
105 

Asp Lys Arg Cys 



Val Ala Arg Val 
140 

Gly Phe Cys Phe 
155 

Pro Trp Gin Leu 
170 

Gin Leu Trp He 
185 

Thr Met Gin Glu 



Leu Phe Arg Cys 
220 

Arg Gin Asp Pro 
235 

Ala Cys He Gin 
250 

Arg Pro He Leu 
265 

He Asp Leu Gly 



He Trp Thr He 
300 

Glu Ser Phe Pro 



Arg Phe Thr Pro 
80 

Cys He Gly Asn 
95 

Arg Phe Arg Arg 
110 

Val Thr Thr Thr 
125 

Asn Leu He Val 



Leu Thr Leu Phe 
160 

Tyr Asn Pro Leu 
175 

Ala Ser He Leu 
190 

Leu Met Ser Asp 
205 

His Leu Ala lie 



Lys Leu Ser Glu 
240 

Asp His Arg Thr 
255 

Ser He Thr He 
270 

Leu Ala Ala He 
285 

Met Ala Asn Val 



Cys Cys Met Leu 
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305 

Cys Glu His Leu 



His Ser Asn Trp 
340 

Tyr Phe Leu His 
355 

lie Phe Pro He 
370 

Phe Thr He He 
385 

Phe Ser Asp Arg 



310 

He Glu Asp Ser 
325 

He Thr Ala Asp 



Arg Ala Gin Gin 
360 

Ser Val Gin Ser 
375 

Thr He Val Asn 
390 

Ser Asn Gly Asp 
405 



315 

Val His Val Ser 
330 

Arg Ser Tyr Lys 
345 

Pro lie Gin Phe 



Asn He Ala Val 
380 

Gin Met Asn Leu 
395 

He Asn Pro 
410 



320 

Asn Ala Leu Phe 
335 

Ser Ala Val Leu 
350 

Thr Ala Gly Ser 
365 

Ala Lys Phe Ala 



Gly Glu Lys Phe 
400 



<210> 67 
<211> 1191 
<212> DNA 

<213> Drosophila melanogaster 

<220> 
<221> CDS 
<222> (1) . . (1191) 
<223> DORLU 4.1 

<400> 67 

atg att ttt aag tac att caa gag cca gtc ctt gga tec tta ttt cga 4 8 

Met He Phe Lys Tyr lie Gin Glu Pro Val Leu Gly Ser Leu Phe Arg 

15 10 15 

tec egg gat teg ctg ate tac tta aac aga tec ata gat caa atg gga 96 
Ser Arg Asp Ser Leu lie Tyr Leu Asn Arg Ser He Asp Gin Met Gly 
20 25 30 

tgg aga ctg ccg cca cga act aag ccg tac tgg tgg etc tat tac att 144 
Trp Arg Leu Pro Pro Arg Thr Lys Pro Tyr Trp Trp Leu Tyr Tyr He 
35 40 45 

tgg aca ttg gtg gtc ata gta etc gtc ttt ate ttt ata ccc tat gga 192 
Trp Thr Leu Val Val He Val Leu Val Phe He Phe He Pro Tyr Gly 
50 55 60 
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ctg ata atg act gga ata aag gag 
Leu lie Met Thr Gly lie Lys Glu 
65 70 

ctg ttt acg tat gtc cag gtg ccg 
Leu Phe Thr Tyr Val Gin Val Pro 
85 



ttc aag aac ttc acg acc acg gat 240 
Phe Lys Asn Phe Thr Thr Thr Asp 
75 80 

gtt aac acc aat get teg ate atg 288 
Val Asn Thr Asn Ala Ser lie Met 
90 95 



aag ggc att ata gtg ttg ttt atg egg egg cga ttt tea agg get cag 336 
Lys Gly lie lie Val Leu Phe Met Arg Arg Arg Phe Ser Arg Ala Gin 
100 105 110 



aag atg atg gac gee atg gac att cga tgc acc aag atg gag gag aaa 384 
Lys Met Met Asp Ala Met Asp lie Arg Cys Thr Lys Met Glu Glu Lys 
115 120 125 



gtc cag gtg cac cga gca gca gec tta tgc aat cgt gtt gtt gtg att 432 
Val Gin Val His Arg Ala Ala Ala Leu Cys Asn Arg Val Val Val lie 
130 135 140 



tac cat tgc ata tac ttc ggc tat eta tec atg gee tta acc gga get 480 
Tyr His Cys lie Tyr Phe Gly Tyr Leu Ser Met Ala Leu Thr Gly Ala 
145 150 155 160 



ctg gtg att ggg aag act cca ttc tgt ttg tac aat cca ctg gtt aac 528 
Leu Val lie Gly Lys Thr Pro Phe Cys Leu Tyr Asn Pro Leu Val Asn 
165 170 175 



ccc gac gat cat ttc tat ctg gec act gec att gaa teg gtc acc atg 576 
Pro Asp Asp His Phe Tyr Leu Ala Thr Ala lie Glu Ser Val Thr Met 
180 185 190 



get ggc att att ctg gee aat etc att ttg gac gta tat ccc ate ata 624 
Ala Gly lie lie Leu Ala Asn Leu lie Leu Asp Val Tyr Pro lie lie 
195 200 205 



tat gtg gtc gtt ctg egg ate cac atg gag etc ttg agt gag cga ate 672 
Tyr Val Val Val Leu Arg lie His Met Glu Leu Leu Ser Glu Arg lie 
. 210 215 220 



aag acg ctg cgt act gat gtg gaa aaa ggc gac gat caa cat tat gec 720 
Lys Thr Leu Arg Thr Asp Val Glu Lys Gly Asp Asp Gin His Tyr Ala 
225 230 235 240 



gag ctg gtg gag tgt gta aag gat cac aag eta att gtc gaa tat gga 768 
Glu Leu Val Glu Cys Val Lys Asp His Lys Leu lie Val Glu Tyr Gly 
245 250 255 
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aac act ctg cgt ccc atg ata tec gec acg atg ttc ate caa eta eta 816 
Asn Thr Leu Arg Pro Met lie Ser Ala Thr Met Phe lie Gin Leu Leu 
260 265 270 

tec gtt ggc tta ctt ttg ggt ctg gca gcg gtg tec atg cag ttc tat 864 
Ser Val Gly Leu Leu Leu Gly Leu Ala Ala Val Ser Met Gin Phe Tyr 
275 280 285 

aac acc gta atg gag cgt gtt gtc tec ggg gtc tac ace ata gec att 912 
Asn Thr Val Met Glu Arg Val Val Ser Gly Val Tyr Thr lie Ala lie 
290 295 300 

eta tec cag acc ttt cca ttt tgc tat gtc tgt gag cag ctg age age 960 
Leu Ser Gin Thr Phe Pro Phe Cys Tyr Val Cys Glu Gin Leu Ser Ser 
305 310 315 320 

gat tgc gaa tec ctg acc aac aca ctg ttc cat tec aag tgg att gga 1008 
Asp Cys Glu Ser Leu Thr Asn Thr Leu Phe His Ser Lys Trp lie Gly 
325 330 335 

get gag cga cga tac aga acc acg atg ttg tac ttc att cac aat gtt 1056 
Ala Glu Arg Arg Tyr Arg Thr Thr Met Leu Tyr Phe lie His Asn Val 
340 345 350 

cag cag teg att ttg ttc act gcg ggc gga att ttc ccc ata tgt eta 1104 
Gin Gin Ser lie Leu Phe Thr Ala Gly Gly lie Phe Pro He Cys Leu 
355 360 365 

aac acc aat ata aag atg gee aag ttc get ttc tea gtg gtg acc att 1152 
Asn Thr Asn He Lys Met Ala Lys Phe Ala Phe Ser Val Val Thr He 
370 375 380 

gta aat gag atg gac ttg gec gag aaa ttg aga agg gag 1191 
Val Asn Glu Met Asp Leu Ala Glu Lys Leu Arg Arg Glu 
385 390 395 



<210> 68 
<211> 397 
<212> PRT 

<213> Drosophila melanogaster 
<400> 68 

Met He Phe Lys Tyr He Gin Glu Pro Val Leu Gly Ser Leu Phe Arg 
15 10 15 

Ser Arg Asp Ser Leu He Tyr Leu Asn Arg Ser He Asp Gin Met Gly 
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Trp Arg Leu Pro Pro Arg Thr Lys Pro Tyr Trp Trp Leu Tyr Tyr lie 
35 40 45 

Trp Thr Leu Val Val lie Val Leu Val Phe lie Phe He Pro Tyr Gly 



Leu He Met Thr Gly He Lys Glu Phe Lys Asn Phe Thr Thr Thr Asp 
65 70 75 80 

Leu Phe Thr Tyr Val Gin Val Pro Val Asn Thr Asn Ala Ser He Met 



Lys Gly He He Val Leu Phe Met Arg Arg Arg Phe Ser Arg Ala Gin 
100 105 110 

Lys Met Met Asp Ala Met Asp He Arg Cys Thr Lys Met Glu Glu Lys 
115 120 125 

Val Gin Val His Arg Ala Ala Ala Leu Cys Asn Arg Val Val Val He 
130 135 140 

Tyr His Cys He Tyr Phe Gly Tyr Leu Ser Met Ala Leu Thr Gly Ala 
145 150 155 160 

Leu Val He Gly Lys Thr Pro Phe Cys Leu Tyr Asn Pro Leu Val Asn 
165 170 175 

Pro Asp Asp His Phe Tyr Leu Ala Thr Ala He Glu Ser Val Thr Met 
180 185 190 

Ala Gly He He Leu Ala Asn Leu He Leu Asp Val Tyr Pro He lie 
195 200 205 

Tyr Val Val Val Leu Arg He His Met Glu Leu Leu Ser Glu Arg He 
210 215 220 

Lys Thr Leu Arg Thr Asp Val Glu Lys Gly Asp Asp Gin His Tyr Ala 
225 230 235 240 

Glu Leu Val Glu Cys Val Lys Asp His Lys Leu He Val Glu Tyr Gly 
245 250 255 

Asn Thr Leu Arg Pro Met He Ser Ala Thr Met Phe He Gin Leu Leu 
260 265 270 

Ser Val Gly Leu Leu Leu Gly Leu Ala Ala Val Ser Met Gin Phe Tyr 
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275 



280 



285 



Asn Thr Val Met 
290 

Leu Ser Gin Thr 

305 

Asp Cys Glu Ser 



Ala Glu Arg Arg 
340 

Gin Gin Ser lie 
355 

Asn Thr Asn lie 
370 

Val Asn Glu Met 
385 



Glu Arg Val Val 
295 

Phe Pro Phe Cys 
310 

Leu Thr Asn Thr 
325 

Tyr Arg Thr Thr 



Leu Phe Thr Ala 
360 

Lys Met Ala Lys 
375 

Asp Leu Ala Glu 
390 



Ser Gly Val Tyr 
300 

Tyr Val Cys Glu 
315 

Leu Phe His Ser 
330 

Met Leu Tyr Phe 
345 

Gly Gly lie Phe 



Phe Ala Phe Ser 

380 

Lys Leu Arg Arg 
395 



Thr He Ala He 



Gin Leu Ser Ser 
320 

Lys Trp He Gly 
335 

He His Asn Val 
350 

Pro He Cys Leu 
365 

Val Val Thr He 



Glu 



<210> 69 
<211> 1191 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1191) 

<223> DORLU 5.1 

<400> 69 

atg ttg ttc aac tat ctg cga aag 
Met Leu Phe Asn Tyr Leu Arg Lys 
1 5 

tct ccg gac tea ttt aga tac ttt 
Ser Pro Asp Ser Phe Arg Tyr Phe 
20 

tgg cac aca cca gca acg cat aag 
Trp His Thr Pro Ala Thr His Lys 
35 40 



ccg aat ccg aca aac ctt ttg act 48 
Pro Asn Pro Thr Asn Leu Leu Thr 
10 15 

gag tat gga atg ttt tgc atg gga 96 
Glu Tyr Gly Met Phe Cys Met Gly 
25 30 

ata ate tac tat ata aca tec tgt 144 
He He Tyr Tyr He Thr Ser Cys 
45 
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ttg att ttt get tgg tgt gec gta tac ttg cca ate gga ate ate att 
Leu lie Phe Ala Trp Cys Ala Val Tyr Leu Pro lie Gly lie lie lie 
50 55 60 

agt ttc aaa acg gat att aac aca ttc aca ccg aat gaa ctg ttg aca 
Ser Phe Lys Thr Asp He Asn Thr Phe Thr Pro Asn Glu Leu Leu Thr 



gtt atg caa tta ttt ttc aat tea gtg gga atg cca ttc aag gtt ctg 
Val Met Gin Leu Phe Phe Asn Ser Val Gly Met Pro Phe Lys Val Leu 



ttc ttc aat ttg tat att tct gga ttt tac aag gee aaa aag etc ctt 

Phe Phe Asn Leu Tyr He Ser Gly Phe Tyr Lys Ala Lys Lys Leu Leu 

100 105 110 

age gaa atg gac aaa cgt tgc ace act ttg aag gag cga gtg gaa gtg 

Ser Glu Met Asp Lys Arg Cys Thr Thr Leu Lys Glu Arg Val Glu Val 
115 120 125 

cac caa ggt gtg gtc cgt tgc aac aag gee tac etc att tac cag ttc 

His Gin Gly Val Val Arg Cys Asn Lys Ala Tyr Leu He Tyr Gin Phe 
130 135 140 

att tat acc gcg tac act att tea aca ttt eta teg gcg get ctt agt 

He Tyr Thr Ala Tyr Thr He Ser Thr Phe Leu Ser Ala Ala Leu Ser 

145 150 155 160 

gga aaa ttg cca tgg cgc ate tat aat cct ttt gtg gat ttt cga gaa 

Gly Lys Leu Pro Trp Arg lie Tyr Asn Pro Phe Val Asp Phe Arg Glu 

165 170 175 

agt aga tec agt ttt tgg aaa get gee etc aac gag aca gca ctt atg 

Ser Arg Ser Ser Phe Trp Lys Ala Ala Leu Asn Glu Thr Ala Leu Met 

180 185 190 

eta ttt get gtg act caa acc eta atg agt gat ata tat cca ctg ctt 

Leu Phe Ala Val Thr Gin Thr Leu Met Ser Asp He Tyr Pro Leu Leu 
195 200 205 

tat ggt ttg ate ctg aga gtt cac etc aaa ctt ttg cga eta aga gtg 

Tyr Gly Leu He Leu Arg Val His Leu Lys Leu Leu Arg Leu Arg Val 
210 215 220 

gag age ctg tgc aca gat tct gga aaa age gat get gaa aac gag caa 

Glu Ser Leu Cys Thr Asp Ser Gly Lys Ser Asp Ala Glu Asn Glu Gin 

225 230 235 240 
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gat ttg att aag tgc ate aag gat cac aat etc att att gac tat get 
Asp Leu lie Lys Cys lie Lys Asp His Asn Leu lie lie Asp Tyr Ala 
245 250 255 

gca gca ata cga cca gcg gtt acc cgc aca att ttc gtt caa ttc etc 
Ala Ala lie Arg Pro Ala Val Thr Arg Thr He Phe Val Gin Phe Leu 
260 265 270 

ttg ate gga att tgc ctt ggc ctt tea atg ate aat eta etc ttc ttt 
Leu He Gly He Cys Leu Gly Leu Ser Met He Asn Leu Leu Phe Phe 
275 280 285 

gec gac ate tgg aca gga ttg gec aca gtg get tac ate aat ggt eta 
Ala Asp He Trp Thr Gly Leu Ala Thr Val Ala Tyr He Asn Gly Leu 
290 295 300 

atg gtg cag aca ttt cca ttt tgc ttc gtt tgt gat eta etc aaa aag 
Met Val Gin Thr Phe Pro Phe Cys Phe Val Cys Asp Leu Leu Lys Lys 
305 310 315 320 

gat tgt gaa ctt ctt gtg teg gee ata ttt cat tec aac tgg att aat 
Asp Cys Glu Leu Leu Val Ser Ala He Phe His Ser Asn Trp He Asn 
325 330 335 

tea age cgc agt tac aag tea tct ttg aga tat ttt ctg aag aac gee 
Ser Ser Arg Ser Tyr Lys Ser Ser Leu Arg Tyr Phe Leu Lys Asn Ala 
340 345 350 

cag aaa tea att get ttt aca gee ggc tct att ttt ccc att tct act 
Gin Lys Ser He Ala Phe Thr Ala Gly Ser He Phe Pro He Ser Thr 
355 360 365 

ggc teg aat att aag gtg get aag ctg gca ttt teg gtg gtt act ttt 
Gly Ser Asn He Lys Val Ala Lys Leu Ala Phe Ser Val Val Thr Phe 
370 375 380 

gtc aat caa ctt aac ata get gac aga ttg aca aag aac 
Val Asn Gin Leu Asn lie Ala Asp Arg Leu Thr Lys Asn 
385 390 395 



<210> 70 
<211> 397 
<212> PRT 

<213> Drosophila melanogaster 
<400> 70 

Met Leu Phe Asn Tyr Leu Arg Lys Pro Asn Pro Thr Asn Leu Leu Thr 
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Ser Pro Asp Ser Phe Arg Tyr Phe Glu Tyr Gly Met Phe Cys Met Gly 
20 25 30 

Trp His Thr Pro Ala Thr His Lys lie He Tyr Tyr He Thr Ser Cys 
35 40 45 

Leu He Phe Ala Trp Cys Ala Val Tyr Leu Pro He Gly He He He 



Ser Phe Lys Thr Asp He Asn Thr Phe Thr Pro Asn Glu Leu Leu Thr 



Val Met Gin Leu Phe Phe Asn Ser Val Gly Met Pro Phe Lys Val Leu 



Phe Phe Asn Leu Tyr He Ser Gly Phe Tyr Lys Ala Lys Lys Leu Leu 
100 105 110 

Ser Glu Met Asp Lys Arg Cys Thr Thr Leu Lys Glu Arg Val Glu Val 
115 120 125 

His Gin Gly Val Val Arg Cys Asn Lys Ala Tyr Leu He Tyr Gin Phe 
130 135 140 

He Tyr Thr Ala Tyr Thr He Ser Thr Phe Leu Ser Ala Ala Leu Ser 
145 150 155 160 

Gly Lys Leu Pro Trp Arg He Tyr Asn Pro Phe Val Asp Phe Arg Glu 
165 170 175 

Ser Arg Ser Ser Phe Trp Lys Ala Ala Leu Asn Glu Thr Ala Leu Met 
180 185 190 

Leu Phe Ala Val Thr Gin Thr Leu Met Ser Asp He Tyr Pro Leu Leu 
195 200 205 

Tyr Gly Leu He Leu Arg Val His Leu Lys Leu Leu Arg Leu Arg Val 
210 215 220 

Glu Ser Leu Cys Thr Asp Ser Gly Lys Ser Asp Ala Glu Asn Glu Gin 
225 230 235 240 

Asp Leu He Lys Cys He Lys Asp His Asn Leu He He Asp Tyr Ala 
245 250 255 

Ala Ala He Arg Pro Ala Val Thr Arg Thr lie Phe Val Gin Phe Leu 
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260 



265 



270 



Leu lie Gly lie 
275 

Ala Asp lie Trp 
290 

Met Val Gin Thr 

305 

Asp Cys Glu Leu 



Ser Ser Arg Ser 
340 

Gin Lys Ser lie 
355 

Gly Ser Asn lie 
370 

Val Asn Gin Leu 
385 



Cys Leu Gly Leu 
280 

Thr Gly Leu Ala 
295 

Phe Pro Phe Cys 
310 

Leu Val Ser Ala 
325 

Tyr Lys Ser Ser 



Ala Phe Thr Ala 
360 

Lys Val Ala Lys 
375 

Asn lie Ala Asp 
390 



Ser Met lie Asn 



Thr Val Ala Tyr 
300 

Phe Val Cys Asp 
315 

lie Phe His Ser 
330 

Leu Arg Tyr Phe 
345 

Gly Ser He Phe 



Leu Ala Phe Ser 
380 

Arg Leu Thr Lys 
395 



Leu Leu Phe Phe 
285 

He Asn Gly Leu 



Leu Leu Lys Lys 
320 

Asn Trp He Asn 
335 

Leu Lys Asn Ala 
350 

Pro He Ser Thr 
365 

Val Val Thr Phe 



Asn 



<210> 71 
<211> 1239 
<212> DNA 

<213> Drosophila melanogaster 

<220> 
<221> CDS 
<222> (1) . . (1239) 
<223> DORLU 6.1 

<400> 71 

atg gcg gtg age act cgt gtg gec 
Met Ala Val Ser Thr Arg Val Ala 
1 5 

egg cga gcg ttt agg aat etc ttc 
Arg Arg Ala Phe Arg Asn Leu Phe 
20 

cag gca ccg gat ggc agt cga ccg 



aca aag cag gaa gtg ccc gaa tec 4 8 
Thr Lys Gin Glu Val Pro Glu Ser 
10 15 

aat tgc ttc tat gec ctt ggc atg 96 
Asn Cys Phe Tyr Ala Leu Gly Met 
25 30 

acc acg age age aca tgg caa cgc 14 4 
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Gin Ala Pro Asp Gly Ser Arg Pro Thr Thr Ser Ser Thr Trp Gin Arg 
35 40 45 



ate tac gec tgc ttc teg gtg gtc atg tac gtg tgg caa ctg ctg ctg 192 
He Tyr Ala Cys Phe Ser Val Val Met Tyr Val Trp Gin Leu Leu Leu 
50 55 60 



gtg ccc aca ttc ttt gtg ate age tat egg tac atg ggc ggc atg gag 240 
Val Pro Thr Phe Phe Val He Ser Tyr Arg Tyr Met Gly Gly Met Glu 
65 70 75 80 



att ace cag gtg ctg acc tec gec cag gtg gee ate gat gcg gtc att 
He Thr Gin Val Leu Thr Ser Ala Gin Val Ala He Asp Ala Val He 



ctg ccg gec aag att gtg gca ctg gcg tgg aat ttg cca ttg ctg cgc 336 
Leu Pro Ala Lys He Val Ala Leu Ala Trp Asn Leu Pro Leu Leu Arg 
100 105 110 



aga gca gag cat cat ctg gec gec ttg gat gcg egg tgc agg gaa cag 38 4 
Arg Ala Glu His His Leu Ala Ala Leu Asp Ala Arg Cys Arg Glu Gin 
115 120 125 



gag gag ttc caa ttg ate 
Glu Glu Phe Gin Leu He 
130 



etc gat gcg gtg agg 
Leu Asp Ala Val Arg 
135 



ttt tgc aac tat ctg 432 

Phe Cys Asn Tyr Leu 
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gta tgg ttc tac cag ate tgc tat gec ate tac tec teg teg aca ttt 480 
Val Trp Phe Tyr Gin He Cys Tyr Ala He Tyr Ser Ser Ser Thr Phe 
145 150 155 160 



gtg tgc gec ttc ctg ctg ggc caa ccg cca tat gee etc tat ttg cct 528 
Val Cys Ala Phe Leu Leu Gly Gin Pro Pro Tyr Ala Leu Tyr Leu Pro 
165 170 ' 175 



ggc etc gat tgg cag cgt tec cag atg cag ttc tgc ate cag gec tgg 57 6 
Gly Leu Asp Trp Gin Arg Ser Gin Met Gin Phe Cys He Gin Ala Trp 
180 185 190 



att gag ttc ctt ate atg aac tgg acg tgc ctg cac caa get age gat 624 
He Glu Phe Leu lie Met Asn Trp Thr Cys Leu His Gin Ala Ser Asp 
195 200 205 



gat gtg tac gee gtt ate tat ctg tat gtg gtc egg att caa gtg caa 672 
Asp Val Tyr Ala Val He Tyr Leu Tyr Val Val Arg He Gin Val Gin 
210 215 220 



ttg ctg gee agg egg gtg gag aag ctg ggc acg gat gat agt ggc cag 720 
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Leu Leu Ala Arg Arg Val Glu Lys Leu Gly Thr Asp Asp Ser Gly Gin 
225 230 235 240 

gtg gag ate tat ccc gat gag egg egg cag gag gag cat tgc gcg gaa 
Val Glu He Tyr Pro Asp Glu Arg Arg Gin Glu Glu His Cys Ala Glu 
245 250 255 

ctg cag cgc tgc att gta gat cac cag acg atg ctg cag ctg etc gac 
Leu Gin Arg Cys He Val Asp His Gin Thr Met Leu Gin Leu Leu Asp 
260 265 270 

tgc att agt ccc gtc ate teg cgt acc ata ttc gtt cag ttc ctg ate 
Cys He Ser Pro Val He Ser Arg Thr He Phe Val Gin Phe Leu He 
275 280 285 

acc gee gee ate atg ggc acc acc atg ate aac att ttc att ttc gee 
Thr Ala Ala He Met Gly Thr Thr Met He Asn He Phe He Phe Ala 
290 295 300 

aat acg aac acg aag ate gca teg ate att tac ctg ctg gcg gtg acc 
Asn Thr Asn Thr Lys He Ala Ser He He Tyr Leu Leu Ala Val Thr 
305 310 315 320 

ctg cag acg get cca tgt tgc tat cag gee acc teg ctg atg ttg gac 
Leu Gin Thr Ala Pro Cys Cys Tyr Gin Ala Thr Ser Leu Met Leu Asp 
325 330 335 

aac gag agg ctg gee ctg gec ate ttc cag tgc cag tgg ctg ggc cag 
Asn Glu Arg Leu Ala Leu Ala He Phe Gin Cys Gin Trp Leu Gly Gin 
340 345 350 

agt gee egg ttc cgt aag atg ctg etc tac tat ctt cat cgc gee cag 
Ser Ala Arg Phe Arg Lys Met Leu Leu Tyr Tyr Leu His Arg Ala Gin 
355 360 365 

cag ccc ate acg ctg acc gee atg aag ctg ttt ccc ate aat ctg gee 
Gin Pro He Thr Leu Thr Ala Met Lys Leu Phe Pro He Asn Leu Ala 
370 375 380 

acg tac ttc agt ata gec aag ttc teg ttt teg etc tac acg etc ate 
Thr Tyr Phe Ser He Ala Lys Phe Ser Phe Ser Leu Tyr Thr Leu He 
385 390 395 400 

aag ggg atg aat etc ggc gag cga ttc aac agg aca aat 
Lys Gly Met Asn Leu Gly Glu Arg Phe Asn Arg Thr Asn 
405 410 



<210> 72 
<211> 413 
<212> PRT 

<213> Drosophila melanogaster 
<400> 72 

Met Ala Val Ser Thr Arg Val Ala Thr Lys Gin Glu Val Pro Glu Ser 



Arg Arg Ala Phe Arg Asn Leu Phe Asn Cys Phe Tyr Ala Leu Gly Met 
20 25 30 

Gin Ala Pro Asp Gly Ser Arg Pro Thr Thr Ser Ser Thr Trp Gin Arg 



lie Tyr Ala Cys Phe Ser Val Val Met Tyr Val Trp Gin Leu Leu Leu 



Val Pro Thr Phe Phe Val He Ser Tyr Arg Tyr Met Gly Gly Met Glu 



He Thr Gin Val Leu Thr Ser Ala Gin Val Ala He Asp Ala Val He 



Leu Pro Ala Lys He Val Ala Leu Ala Trp Asn Leu Pro Leu Leu Arg 
100 105 110 

Arg Ala Glu His His Leu Ala Ala Leu Asp Ala Arg Cys Arg Glu Gin 
115 120 125 

Glu Glu Phe Gin Leu He Leu Asp Ala Val Arg Phe Cys Asn Tyr Leu 
130 135 140 

Val Trp Phe Tyr Gin He Cys Tyr Ala lie Tyr Ser Ser Ser Thr Phe 
145 150 155 160 

Val Cys Ala Phe Leu Leu Gly Gin Pro Pro Tyr Ala Leu Tyr Leu Pro 
165 170 175 

Gly Leu Asp Trp Gin Arg Ser Gin Met Gin Phe Cys He Gin Ala Trp 
180 185 190 

He Glu Phe Leu He Met Asn Trp Thr Cys Leu His Gin Ala Ser Asp 
195 200 205 

Asp Val Tyr Ala Val He Tyr Leu Tyr Val Val Arg He Gin Val Gin 
210 215 220 
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Leu Leu Ala Arg Arg Val Glu Lys Leu Gly Thr Asp Asp Ser Gly Gin 
225 230 235 240 



Val Glu lie Tyr Pro Asp Glu Arg Arg Gin Glu Glu His Cys Ala Glu 
245 250 255 

Leu Gin Arg Cys lie Val Asp His Gin Thr Met Leu Gin Leu Leu Asp 
260 265 270 

Cys lie Ser Pro Val lie Ser Arg Thr lie Phe Val Gin Phe Leu lie 
275 280 285 

Thr Ala Ala lie Met Gly Thr Thr Met He Asn He Phe He Phe Ala 
290 295 300 

Asn Thr Asn Thr Lys He Ala Ser He He Tyr Leu Leu Ala Val Thr 
305 310 315 320 

Leu Gin Thr Ala Pro Cys Cys Tyr Gin Ala Thr Ser Leu Met Leu Asp 
325 330 335 

Asn Glu Arg Leu Ala Leu Ala He Phe Gin Cys Gin Trp Leu Gly Gin 
340 345 350 

Ser Ala Arg Phe Arg Lys Met Leu Leu Tyr Tyr Leu His Arg Ala Gin 
355 360 365 

Gin Pro He Thr Leu Thr Ala Met Lys Leu Phe Pro lie Asn Leu Ala 

370 375 380 

Thr Tyr Phe Ser He Ala Lys Phe Ser Phe Ser Leu Tyr Thr Leu He 
385 390 395 400 

Lys Gly Met Asn Leu Gly Glu Arg Phe Asn Arg Thr Asn 
405 410 



<210> 73 
<211> 1089 
<212> DNA 

<213> Drosophila melanogaster 

<220> 
<221> CDS 
<222> (1) . . (1089) 
<223> DORLU 7.1 
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<400> 73 

atg gac tac gat cga att cga ccg gtg cga ttt ttg acg gga gtg ctg 48 
Met Asp Tyr Asp Arg lie Arg Pro Val Arg Phe Leu Thr Gly Val Leu 
15 10 15 



aaa tgg tgg cgt etc tgg ccg agg aag gaa teg gtg tec aca ccg gac 96 
Lys Trp Trp Arg Leu Trp Pro Arg Lys Glu Ser Val Ser Thr Pro Asp 
20 25 30 



tgg act aac tgg cag gca tat gec ttg cac gtt cca ttt aca ttc ttg 144 
Trp Thr Asn Trp Gin Ala Tyr Ala Leu His Val Pro Phe Thr Phe Leu 
35 40 45 



ttt gtg ttg ctt ttg tgg ttg gag gca ate aag age agg gat ata cag 192 
Phe Val Leu Leu Leu Trp Leu Glu Ala lie Lys Ser Arg Asp lie Gin 



cat acc gee gat gtc ctt ttg att tgc eta ace acc act gee ttg gga 240 
His Thr Ala Asp Val Leu Leu lie Cys Leu Thr Thr Thr Ala Leu Gly 
65 70 75 80 



ggt aaa gtt ate aat ate tgg aag tat gee cat gtg gee caa ggc att 288 
Gly Lys Val lie Asn lie Trp Lys Tyr Ala His Val Ala Gin Gly lie 
85 90 95 



ttg tec gag tgg age acg tgg gat ctt ttc gag ctg agg age aaa cag 336 
Leu Ser Glu Trp Ser Thr Trp Asp Leu Phe Glu Leu Arg Ser Lys Gin 
100 105 110 



gaa gtg gat atg tgg cga ttc gag cat cga cgt ttc aat cgt gtt ttt 384 
Glu Val Asp Met Trp Arg Phe Glu His Arg Arg Phe Asn Arg Val Phe 
115 120 125 



atg ttt tac tgt ttg tgc agt get ggt gta ate cca ttt att gtg att 432 
Met Phe Tyr Cys Leu Cys Ser Ala Gly Val lie Pro Phe lie Val lie 
130 135 140 



caa ccg ttg ttt gat ate cca aat cga ttg ccc ttc tgg atg tgg aca 480 
Gin Pro Leu Phe Asp lie Pro Asn Arg Leu Pro Phe Trp Met Trp Thr 
145 150 155 160 



cca ttc gat tgg cag cag cct gtt etc tta tgg tat gca ttc ate tat 528 
Pro Phe Asp Trp Gin Gin Pro Val Leu Leu Trp Tyr Ala Phe lie Tyr 
165 170 175 



cag gec aca acc att cct att gec tgt get tgc aac gta acc atg gac 57 6 
Gin Ala Thr Thr lie Pro He Ala Cys Ala Cys Asn Val Thr Met Asp 

180 185 190 
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get gtt aat tgg tac ttg atg ctg cat ctg tec ttg tgt ttg cgt atg 
Ala Val Asn Trp Tyr Leu Met Leu His Leu Ser Leu Cys Leu Arg Met 
195 200 205 

ttg ggc cag cga ttg agt aag ctt cag cat gat gac aag gat ctg agg 
Leu Gly Gin Arg Leu Ser Lys Leu Gin His Asp Asp Lys Asp Leu Arg 
210 215 220 

gag aag ttc ctg gaa ctg ate cat ctg cac cag cga etc aag caa cag 
Glu Lys Phe Leu Glu Leu lie His Leu His Gin Arg Leu Lys Gin Gin 
225 230 235 240 

gec ttg age att gaa ate ttt att teg aag age acg ttc acc caa att 
Ala Leu Ser lie Glu He Phe He Ser Lys Ser Thr Phe Thr Gin He 
245 250 255 

ctg gtc agt tec ctt ate att tgc ttc acc att tac age atg cag atg 
Leu Val Ser Ser Leu He He Cys Phe Thr He Tyr Ser Met Gin Met 
260 265 270 

tac eta gtg gec atg ate atg cag gtc atg ctg ccc acc ata tat ggt 
Tyr Leu Val Ala Met He Met Gin Val Met Leu Pro Thr He Tyr Gly 
275 280 285 

aac gec gtc ate gat tct gca aat atg ttg acc gat tec atg tac aat 
Asn Ala Val He Asp Ser Ala Asn Met Leu Thr Asp Ser Met Tyr Asn 
290 295 300 

teg gat tgg ccg gat atg aat tgc cga atg cgt cgc eta gtt tta atg 
Ser Asp Trp Pro Asp Met Asn Cys Arg Met Arg Arg Leu Val Leu Met 
305 310 315 320 

ttt atg gtg tac tta aat cga ccg gtg acc tta aaa gee ggt ggc ttt 
Phe Met Val Tyr Leu Asn Arg Pro Val Thr Leu Lys Ala Gly Gly Phe 
325 330 335 

ttt cat att ggt tta cct ctg ttt acc aag acc atg aat caa gca tac 
Phe His He Gly Leu Pro Leu Phe Thr Lys Thr Met Asn Gin Ala Tyr 
340 345 350 

agt ttg ctg gee ttg ctg etc aac atg aac caa 
Ser Leu Leu Ala Leu Leu Leu Asn Met Asn Gin 
355 360 



<210> 74 
<211> 363 



<212> PRT 

<213> Drosophila melanogaster 



<400> 74 

Met Asp Tyr Asp Arg lie Arg Pro Val Arg Phe Leu Thr Gly Val Leu 



Lys Trp Trp Arg Leu Trp Pro Arg Lys Glu Ser Val Ser Thr Pro Asp 
20 25 30 

Trp Thr Asn Trp Gin Ala Tyr Ala Leu His Val Pro Phe Thr Phe Leu 



Phe Val Leu Leu Leu Trp Leu Glu Ala lie Lys Ser Arg Asp lie Gin 
50 55 60 

His Thr Ala Asp Val Leu Leu lie Cys Leu Thr Thr Thr Ala Leu Gly 
65 70 75 80 

Gly Lys Val lie Asn lie Trp Lys Tyr Ala His Val Ala Gin Gly lie 
85 90 95 

Leu Ser Glu Trp Ser Thr Trp Asp Leu Phe Glu Leu Arg Ser Lys Gin 
100 105 110 

Glu Val Asp Met Trp Arg Phe Glu His Arg Arg Phe Asn Arg Val Phe 
115 120 125 

Met Phe Tyr Cys Leu Cys Ser Ala Gly Val lie Pro Phe lie Val lie 
130 135 140 

Gin Pro Leu Phe Asp lie Pro Asn Arg Leu Pro Phe Trp Met Trp Thr 
145 150 155 160 

Pro Phe Asp Trp Gin Gin Pro Val Leu Leu Trp Tyr Ala Phe lie Tyr 
165 170 175 

Gin Ala Thr Thr lie Pro lie Ala Cys Ala Cys Asn Val Thr Met Asp 
180 185 190 

Ala Val Asn Trp Tyr Leu Met Leu His Leu Ser Leu Cys Leu Arg Met 
195 200 205 

Leu Gly Gin Arg Leu Ser Lys Leu Gin His Asp Asp Lys Asp Leu Arg 
210 215 220 

Glu Lys Phe Leu Glu Leu lie His Leu His Gin Arg Leu Lys Gin Gin 
225 230 235 240 
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Ala Leu Ser lie 



Leu Val Ser Ser 
260 

Tyr Leu Val Ala 
275 

Asn Ala Val lie 
290 

Ser Asp Trp Pro 
305 

Phe Met Val Tyr 



Phe His lie Gly 
340 

Ser Leu Leu Ala 
355 



Glu He Phe He 
245 

Leu He He Cys 



Met He Met Gin 
280 

Asp Ser Ala Asn 
295 

Asp Met Asn Cys 
310 

Leu Asn Arg Pro 
325 

Leu Pro Leu Phe 



Leu Leu Leu Asn 
360 



Ser Lys Ser Thr 
250 

Phe Thr He Tyr 
265 

Val Met Leu Pro 



Met Leu Thr Asp 
300 

Arg Met Arg Arg 
315 

Val Thr Leu Lys 
330 

Thr Lys Thr Met 
345 

Met Asn Gin 



Phe Thr Gin He 
255 

Ser Met Gin Met 
270 

Thr He Tyr Gly 
285 

Ser Met Tyr Asn 



Leu Val Leu Met 
320 

Ala Gly Gly Phe 
335 

Asn Gin Ala Tyr 
350 



<210> 75 
<211> 1176 
<212> DNA 

<213> Drosophila melanogaster 

<220> 
<221> CDS 
<222> (1) . . (1176) 
<223> DORLU 9.1 

<400> 75 

atg age gac aag gtg aag gga aaa aag cag gag gaa aag gat caa tec 48 
Met Ser Asp Lys Val Lys Gly Lys Lys Gin Glu Glu Lys Asp Gin Ser 

15 10 15 

ttg egg gtg caa att etc gtt tat cgc tgc atg ggc ate gat ttg tgg 96 
Leu Arg Val Gin He Leu Val Tyr Arg Cys Met Gly He Asp Leu Trp 
20 25 30 

age ccc acg atg gcg aat gac cgc ccg tgg ctg acc ttt gtc aca atg 14 4 
Ser Pro Thr Met Ala Asn Asp Arg Pro Trp Leu Thr Phe Val Thr Met 
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gga cca ctt ttc ctg ttt atg gtg ccc atg ttc ctg gcc gcc cac gag 
Gly Pro Leu Phe Leu Phe Met Val Pro Met Phe Leu Ala Ala His Glu 



tac ate acc cag gtg age ctg etc tec gac ace ctg ggc tec acc ttc 
Tyr lie Thr Gin Val Ser Leu Leu Ser Asp Thr Leu Gly Ser Thr Phe 
65 70 75 80 

gcc age atg etc acc ctg gtc aaa ttc ctg etc ttc tgc tat cat cgc 
Ala Ser Met Leu Thr Leu Val Lys Phe Leu Leu Phe Cys Tyr His Arg 
85 90 95 

aag gag ttc gtc ggc ctg ate tac cac ate agg gcc att ctg get aaa 
Lys Glu Phe Val Gly Leu lie Tyr His lie Arg Ala lie Leu Ala Lys 
100 105 110 

gaa ate gaa gtg tgg cct gat gcg egg gaa ate ate gag gtg gag aac 
Glu lie Glu Val Trp Pro Asp Ala Arg Glu lie He Glu Val Glu Asn 
115 120 125 

caa agt gac caa atg etc agt ctt acg tac act cgc tgt ttt gga ctg 
Gin Ser Asp Gin Met Leu Ser Leu Thr Tyr Thr Arg Cys Phe Gly Leu 
130 135 140 

get gga ate ttt gcg gcc ctg aag ccc ttt gtg ggc ate ata etc tec 
Ala Gly He Phe Ala Ala Leu Lys Pro Phe Val Gly He He Leu Ser 
145 150 155 160 

teg att cgc ggc gac gag att cac ctg gag ctg ccc cac aac ggc gtt 
Ser He Arg Gly Asp Glu He His Leu Glu Leu Pro His Asn Gly Val 
165 170 175 

tac ccg tac gat etc cag gtg gtc atg ttt tat gtg ccc acc tat ctg 
Tyr Pro Tyr Asp Leu Gin Val Val Met Phe Tyr Val Pro Thr Tyr Leu 
180 185 190 

tgg aat gtg atg gcc age tat agt get gta acc atg gca etc tgc gtg 
Trp Asn Val Met Ala Ser Tyr Ser Ala Val Thr Met Ala Leu Cys Val 
195 200 205 

gac teg ctg etc ttc ttt ttc acc tac aac gtg tgc gcc att ttc aag 
Asp Ser Leu Leu Phe Phe Phe Thr Tyr Asn Val Cys Ala He Phe Lys 
210 215 220 

ate gcc aag cac egg atg ate cat ctg ccg gcg gtg ggc gga aag gag 
He Ala Lys His Arg Met He His Leu Pro Ala Val Gly Gly Lys Glu 



gag ctg gag ggg etc gtc cag gtg ctg ctg ctg cac cag aag ggc etc 
Glu Leu Glu Gly Leu Val Gin Val Leu Leu Leu His Gin Lys Gly Leu 
245 250 255 

cag ate gec gat cac att gcg gac aag tac egg ccg ctg ate ttt ttg 
Gin lie Ala Asp His lie Ala Asp Lys Tyr Arg Pro Leu lie Phe Leu 
260 265 270 

cag ttc ttt ctg tec gec ttg cag ate tgc ttc att gga ttc cag gtg 
Gin Phe Phe Leu Ser Ala Leu Gin He Cys Phe He Gly Phe Gin Val 
275 280 285 

get gat ctg ttt ccc aat ccg cag agt etc tac ttt ate gee ttt gtg 
Ala Asp Leu Phe Pro Asn Pro Gin Ser Leu Tyr Phe He Ala Phe Val 
290 295 300 

ggc teg ctg etc ate gca ctg ttc ate tac teg aag tgc ggc gaa aat 
Gly Ser Leu Leu He Ala Leu Phe lie Tyr Ser Lys Cys Gly Glu Asn 
305 310 315 320 

ate aag agt gec age ctg gat ttc gga aac ggg ctg tac gag ace aac 
He Lys Ser Ala Ser Leu Asp Phe Gly Asn Gly Leu Tyr Glu Thr Asn 
325 330 335 

tgg acc gac ttc teg cca ccc act aaa aga gec etc etc att gec gec 
Trp Thr Asp Phe Ser Pro Pro Thr Lys Arg Ala Leu Leu lie Ala Ala 
340 345 350 

atg cgc gec cag cga cct tgc cag atg aag ggc tac ttt ttc gag gec 
Met Arg Ala Gin Arg Pro Cys Gin Met Lys Gly Tyr Phe Phe Glu Ala 
355 360 365 

age atg gec acc ttc teg acg att gtt cgc tct gee gtg teg tac ate 
Ser Met Ala Thr Phe Ser Thr He Val Arg Ser Ala Val Ser Tyr He 
370 375 380 

atg atg ttg cgc tec ttt aat gec 
Met Met Leu Arg Ser Phe Asn Ala 
385 390 



<210> 76 
<211> 392 
<212> PRT 
<213> Drosophila 
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<400> 76 

Met Ser Asp Lys Val Lys Gly Lys Lys Gin Glu Glu Lys Asp Gin Ser 
15 10 15 



Leu Arg Val Gin He Leu Val Tyr Arg Cys Met Gly He Asp Leu Trp 
20 25 30 

Ser Pro Thr Met Ala Asn Asp Arg Pro Trp Leu Thr Phe Val Thr Met 



Gly Pro Leu Phe Leu Phe Met Val Pro Met Phe Leu Ala Ala His Glu 



Tyr He Thr Gin Val Ser Leu Leu Ser Asp Thr Leu Gly Ser Thr Phe 



Ala Ser Met Leu Thr Leu Val Lys Phe Leu Leu Phe Cys Tyr His Arg 



Lys Glu Phe Val Gly Leu He Tyr His He Arg Ala He Leu Ala Lys 
100 105 110 

Glu He Glu Val Trp Pro Asp Ala Arg Glu He lie Glu Val Glu Asn 
115 120 125 

Gin Ser Asp Gin Met Leu Ser Leu Thr Tyr Thr Arg Cys Phe Gly Leu 
130 135 140 

Ala Gly He Phe Ala Ala Leu Lys Pro Phe Val Gly lie lie Leu Ser 
145 • 150 155 160 

Ser lie Arg Gly Asp Glu lie His Leu Glu Leu Pro His Asn Gly Val 
165 170 175 

Tyr Pro Tyr Asp Leu Gin Val Val Met Phe Tyr Val Pro Thr Tyr Leu 
180 185 190 

Trp Asn Val Met Ala Ser Tyr Ser Ala Val Thr Met Ala Leu Cys Val 
195 200 205 

Asp Ser Leu Leu Phe Phe Phe Thr Tyr Asn Val Cys Ala lie Phe Lys 
210 215 220 

He Ala Lys His Arg Met lie His Leu Pro Ala Val Gly Gly Lys Glu 
225 230 235 240 

Glu Leu Glu Gly Leu Val Gin Val Leu Leu Leu His Gin Lys Gly Leu 
245 250 255 
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Gin lie Ala Asp His lie Ala Asp Lys Tyr Arg Pro Leu lie Phe Leu 
260 265 270 



Gin Phe Phe Leu Ser Ala Leu Gin 
275 280 

Ala Asp Leu Phe Pro Asn Pro Gin 
290 295 

Gly Ser Leu Leu lie Ala Leu Phe 
305 310 

lie Lys Ser Ala Ser Leu Asp Phe 
325 

Trp Thr Asp Phe Ser Pro Pro Thr 
340 

Met Arg Ala Gin Arg Pro Cys Gin 

355 360 

Ser Met Ala Thr Phe Ser Thr He 
370 375 

Met Met Leu Arg Ser Phe Asn Ala 
385 390 



He Cys Phe He Gly Phe Gin Val 
285 

Ser Leu Tyr Phe He Ala Phe Val 
300 

He Tyr Ser Lys Cys Gly Glu Asn 

315 320 

Gly Asn Gly Leu Tyr Glu Thr Asn 
330 335 

Lys Arg Ala Leu Leu He Ala Ala 
345 350 

Met Lys Gly Tyr Phe Phe Glu Ala 
365 

Val Arg Ser Ala Val Ser Tyr He 
380 



<210> 77 
<211> 1221 
<212> DNA 

<213> Drosophila melanogaster 

<220> 
<221> CDS 
<222> (1) . . (1221) 
<223> DORLU 12.1 

<400> 77 

atg gat aac gtc gcg gaa atg cct 
Met Asp Asn Val Ala Glu Met Pro 
1 5 

gat ttt ttg agg eta get gtg aaa 
Asp Phe Leu Arg Leu Ala Val Lys 
20 



gaa gaa aag tat gtc gaa gtc gat 4 8 
Glu Glu Lys Tyr Val Glu Val Asp 
10 15 

ttc tac aat act ttg ggc att gat 96 
Phe Tyr Asn Thr Leu Gly He Asp 
25 30 
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ccc tat gaa act gga cga aaa cga act att tgg ttt caa ata tat ttc 
Pro Tyr Glu Thr Gly Arg Lys Arg Thr He Trp Phe Gin He Tyr Phe 



gca ttg aat atg ttt aat atg gtg ttt agt ttt tat gcc gag gta gcg 

Ala Leu Asn Met Phe Asn Met Val Phe Ser Phe Tyr Ala Glu Val Ala 

50 55 60 

act ctg gtg gac agg tta cgc gat aat gaa aat ttt etc gag age tgc 

Thr Leu Val Asp Arg Leu Arg Asp Asn Glu Asn Phe Leu Glu Ser Cys 



ate tta ctg age tac gtg tec ttt gtg gtc atg ggc etc tec aag ata 
He Leu Leu Ser Tyr Val Ser Phe Val Val Met Gly Leu Ser Lys He 



ggt get gta atg aaa aaa aag cca aaa atg aca get ttg gtc agg caa 
Gly Ala Val Met Lys Lys Lys Pro Lys Met Thr Ala Leu Val Arg Gin 
100 105 110 

ttg gag acc tgc ttt ccg teg cca agt gca aag gtt caa gag gaa tat 
Leu Glu Thr Cys Phe Pro Ser Pro Ser Ala Lys Val Gin Glu Glu Tyr 
115 120 125 

get gtg aag tec tgg ctg aaa cgc tgc cat ata tac aca aag gga ttt 
Ala Val Lys Ser Trp Leu Lys Arg Cys His He Tyr Thr Lys Gly Phe 
130 135 140 

ggt ggt etc ttc atg ate atg tat ttc get cac get ctg att ccc tta 
Gly Gly Leu Phe Met He Met Tyr Phe Ala His Ala Leu He Pro Leu 
145 150 155 160 

ttc ata tac ttc att caa aga gtg ctg etc cac tat ccg gat gcc aag 
Phe He Tyr Phe He Gin Arg Val Leu Leu His Tyr Pro Asp Ala Lys 
165 170 175 

cag att atg ccg ttt tac caa etc gaa cct tgg gaa ttt cgc gac tec 
Gin He Met Pro Phe Tyr Gin Leu Glu Pro Trp Glu Phe Arg Asp Ser 
180 185 190 

tgg ttg ttt tat cca age tat ttt cac cag teg teg gcc gga tat acg 
Trp Leu Phe Tyr Pro Ser Tyr Phe His Gin Ser Ser Ala Gly Tyr Thr 
195 200 205 

get aca tgt gga tec att gcc ggt gac eta atg ate ttc get gtg gtc 
Ala Thr Cys Gly Ser He Ala Gly Asp Leu Met He Phe Ala Val Val 
210 215 220 
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ctg cag gtc ate atg cac tac gaa aga ctg gec aag gtt ctt agg gag 
Leu Gin Val lie Met His Tyr Glu Arg Leu Ala Lys Val Leu Arg Glu 
225 230 235 240 
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ttt aag att caa gec cat aac gca ccc aat gga get aag gag gat ata 7 68 
Phe Lys lie Gin Ala His Asn Ala Pro Asn Gly Ala Lys Glu Asp lie 
245 250 255 

agg aag ttg cag tec eta gtc gee aat cac att gat ata ctt cga etc 816 
Arg Lys Leu Gin Ser Leu Val Ala Asn His lie Asp lie Leu Arg Leu 
260 265 270 

act gat ctg atg aac gag gtc ttt gga att ccc ttg ttg eta aac ttt 8 64 
Thr Asp Leu Met Asn Glu Val Phe Gly lie Pro Leu Leu Leu Asn Phe 
275 280 285 

att gca tct gcg ctg ctg gtc tgc ctg gtg gga gtt caa tta acc ate 912 
lie Ala Ser Ala Leu Leu Val Cys Leu Val Gly Val Gin Leu Thr lie 
290 295 300 

get tta agt cca gag tat ttt tgc aag cag atg eta ttt ctg att tec 960 
Ala Leu Ser Pro Glu Tyr Phe Cys Lys Gin Met Leu Phe Leu lie Ser 
305 310 315 320 

gta ctg ctt gag gtc tat etc ctt tgc tec ttc age cag agg tta ata 1008 
Val Leu Leu Glu Val Tyr Leu Leu Cys Ser Phe Ser Gin Arg Leu lie 
325 330 335 

gat get age gaa aac gtg ggc cat gcg gca tac gat atg gat tgg tta 1056 
Asp Ala Ser Glu Asn Val Gly His Ala Ala Tyr Asp Met Asp Trp Leu 
340 345 350 

ggt tec gac aaa cga ttc aag aaa att tta att ttt ata tct atg cga 1104 
Gly Ser Asp Lys Arg Phe Lys Lys lie Leu lie Phe lie Ser Met Arg 
355 360 365 

tec cag aag cca gtt tgc ctt aaa gec aca gtt gtc ttg gac tta tec 1152 
Ser Gin Lys Pro Val Cys Leu Lys Ala Thr Val Val Leu Asp Leu Ser 
370 375 380 

atg cca act atg age ate ttt ctt ggt atg teg tat aag ttt ttc tgc 1200 
Met Pro Thr Met Ser lie Phe Leu Gly Met Ser Tyr Lys Phe Phe Cys 
385 390 395 400 

get gtg agg act atg tat caa 1221 
Ala Val Arg Thr Met Tyr Gin 
405 
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<210> 78 
<211> 407 
<212> PRT 

<213> Drosophila melanogaster 
<400> 78 

Met Asp Asn Val Ala Glu Met Pro Glu Glu Lys Tyr Val Glu Val Asp 
15 10 15 

Asp Phe Leu Arg Leu Ala Val Lys Phe Tyr Asn Thr Leu Gly lie Asp 
20 25 30 

Pro Tyr Glu Thr Gly Arg Lys Arg Thr lie Trp Phe Gin lie Tyr Phe 
35 40 45 

Ala Leu Asn Met Phe Asn Met Val Phe Ser Phe Tyr Ala Glu Val Ala 
50 55 60 

Thr Leu Val Asp Arg Leu Arg Asp Asn Glu Asn Phe Leu Glu Ser Cys 
65 70 75 80 

lie Leu Leu Ser Tyr Val Ser Phe Val Val Met Gly Leu Ser Lys lie 
85 90 95 

Gly Ala Val Met Lys Lys Lys Pro Lys Met Thr Ala Leu Val Arg Gin 
100 105 110 

Leu Glu Thr Cys Phe Pro Ser Pro Ser Ala Lys Val Gin Glu Glu Tyr 
115 120 125 

Ala Val Lys Ser Trp Leu Lys Arg Cys His lie Tyr Thr Lys Gly Phe 
130 135 140 

Gly Gly Leu Phe Met lie Met Tyr Phe Ala His Ala Leu lie Pro Leu 
145 150 155 160 

Phe lie Tyr Phe lie Gin Arg Val Leu Leu His Tyr Pro Asp Ala Lys 
165 170 175 

Gin lie Met Pro Phe Tyr Gin Leu Glu Pro Trp Glu Phe Arg Asp Ser 
180 185 190 

Trp Leu Phe Tyr Pro Ser Tyr Phe His Gin Ser Ser Ala Gly Tyr Thr 
195 200 205 

Ala Thr Cys Gly Ser lie Ala Gly Asp Leu Met He Phe Ala Val Val 
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210 



215 



220 



Leu Gin Val lie Met His Tyr Glu Arg Leu Ala Lys Val Leu Arg Glu 
225 230 235 240 

Phe Lys lie Gin Ala His Asn Ala Pro Asn Gly Ala Lys Glu Asp lie 
245 250 255 

Arg Lys Leu Gin Ser Leu Val Ala Asn His lie Asp lie Leu Arg Leu 
260 265 270 

Thr Asp Leu Met Asn Glu Val Phe Gly lie Pro Leu Leu Leu Asn Phe 
275 280 285 

He Ala Ser Ala Leu Leu Val Cys Leu Val Gly Val Gin Leu Thr He 
290 295 300 

Ala Leu Ser Pro Glu Tyr Phe Cys Lys Gin Met Leu Phe Leu He Ser 
305 310 315 320 

Val Leu Leu Glu Val Tyr Leu Leu Cys Ser Phe Ser Gin Arg Leu He 
325 330 335 

Asp Ala Ser Glu Asn Val Gly His Ala Ala Tyr Asp Met Asp Trp Leu 
340 345 350 

Gly Ser Asp Lys Arg Phe Lys Lys He Leu He Phe He Ser Met Arg 
355 360 365 

Ser Gin Lys Pro Val Cys Leu Lys Ala Thr Val Val Leu Asp Leu Ser 
370 375 380 

Met Pro Thr Met Ser He Phe Leu Gly Met Ser Tyr Lys Phe Phe Cys 
385 390 395 400 

Ala Val Arg Thr Met Tyr Gin 
405 



<210> 79 
<211> 1212 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1212) 
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<223> DORLU 13.1 



<400> 79 

atg gaa aca gcg aag gat aat aca gcc agg act ttt atg gaa ttg atg 48 
Met Glu Thr Ala Lys Asp Asn Thr Ala Arg Thr Phe Met Glu Leu Met 
15 10 15 



cga gtg cca gta cag ttt tac aga acg att gga gag gat ate tac gcc 96 
Arg Val Pro Val Gin Phe Tyr Arg Thr lie Gly Glu Asp lie Tyr Ala 
20 25 30 



cat cga tec acg aat ccc eta aaa teg ctt etc ttc aag ate tat eta 144 
His Arg Ser Thr Asn Pro Leu Lys Ser Leu Leu Phe Lys lie Tyr Leu 
35 40 45 



tat gcg gga ttc ata aat ttt aat ctg ttg gta ate ggt gaa ctg gtg 192 
Tyr Ala Gly Phe He Asn Phe Asn Leu Leu Val He Gly Glu Leu Val 
50 55 60 



ttc ttc tac aac tea att cag gac ttt gaa acc att cga ttg gcc ate 240 
Phe Phe Tyr Asn Ser He Gin Asp Phe Glu Thr He Arg Leu Ala He 
65 70 75 80 



gcg gtg get cca tgt ate gga ttt tct ctg gtt get gat ttt aaa caa 288 
Ala Val Ala Pro Cys He Gly Phe Ser Leu Val Ala Asp Phe Lys Gin 
85 90 95 



get gcc atg att aga ggc aag aaa aca eta att atg eta etc gat gat 336 
Ala Ala Met lie Arg Gly Lys Lys Thr Leu He Met Leu Leu Asp Asp 
100 105 110 



ttg gag aac atg cat ccg aaa acc ctg gca aag caa atg gaa tac aaa 384 
Leu Glu Asn Met His Pro Lys Thr Leu Ala Lys Gin Met Glu Tyr Lys 
115 120 125 



ttg ccg gac ttt gaa aag acc atg aaa cgt gtg ate aat ata ttc acc 432 
Leu Pro Asp Phe Glu Lys Thr Met Lys Arg Val He Asn He Phe Thr 
130 135 140 



ttt etc tgc ttg gcc tat acg act acg ttc tec ttt tat ccg gcc ate 480 
Phe Leu Cys Leu Ala Tyr Thr Thr Thr Phe Ser Phe Tyr Pro Ala He 
145 150 155 160 



aag gca tec gtg aaa ttt aat ttc ttg ggc tac gac acc ttt gat cga 528 
Lys Ala Ser Val Lys Phe Asn Phe Leu Gly Tyr Asp Thr Phe Asp Arg 
165 170 175 



aat ttt ggt ttc etc ate tgg ttt ccc ttc gat gca aca agg aat aat 576 
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Asn Phe Gly Phe Leu He Trp Phe Pro Phe Asp Ala Thr Arg Asn Asn 
180 185 190 

ttg ata tac tgg ate atg tac tgg gac ata gec cat ggg gec tat eta 
Leu lie Tyr Trp lie Met Tyr Trp Asp lie Ala His Gly Ala Tyr Leu 
195 200 205 

gcg ggt att get ttt etc tgc gec gat ctt ttg etc gtc gta gtc att 
Ala Gly He Ala Phe Leu Cys Ala Asp Leu Leu Leu Val Val Val He 
210 215 220 

ace cag att tgt atg cac ttt aac tat ata tct atg cga tta gag gat 
Thr Gin He Cys Met His Phe Asn Tyr He Ser Met Arg Leu Glu Asp 
225 230 235 240 

cat cca tgt aat teg aat gag gac aaa gag aat ata gag ttt ctt att 
His Pro Cys Asn Ser Asn Glu Asp Lys Glu Asn He Glu Phe Leu He 
245 250 255 

ggc att ate aga tac cat gac aag tgc ctt aaa eta tgc gaa cat gtc 
Gly He He Arg Tyr His Asp Lys Cys Leu Lys Leu Cys Glu His Val 
260 265 270 

aac gat ctg tat agt ttc tct ttg ctg ctt aat ttc ctt atg gca tec 
Asn Asp Leu Tyr Ser Phe Ser Leu Leu Leu Asn Phe Leu Met Ala Ser 
275 280 285 

atg cag att tgt ttc ata gee ttt cag gtc acc gaa tea aca gtg gaa 
Met Gin He Cys Phe He Ala Phe Gin Val Thr Glu Ser Thr Val Glu 
290 295 300 

gtg att att att tac tgc att ttt ttg atg acc teg atg gtt cag gta 
Val He He He Tyr Cys He Phe Leu Met Thr Ser Met Val Gin Val 
305 310 315 320 

ttt atg gtg tgc tac tat ggg gat act tta att gec gcg age ttg aaa 
Phe Met Val Cys Tyr Tyr Gly Asp Thr Leu He Ala Ala Ser Leu Lys 
325 330 335 

gtg ggc gat gec get tac aac caa aag tgg ttt cag tgc age aaa tec 
Val Gly Asp Ala Ala Tyr Asn Gin Lys Trp Phe Gin Cys Ser Lys Ser 
340 345 350 

tat tgc acc atg ttg aag ttg eta ate atg agg agt cag aaa cca get 
Tyr Cys Thr Met Leu Lys Leu Leu He Met Arg Ser Gin Lys Pro Ala 
355 360 365 

tea ata aga ccg ccg act ttt ccc ccc ata tec ttg gtt acc tat atg 

160 



in 



Ser lie Arg Pro Pro Thr Phe Pro Pro lie Ser Leu Val Thr Tyr Met 
370 375 380 



aag gtc ate age atg teg tat caa ttt ttt gee tta ctt aga acc aca 1200 
Lys Val lie Ser Met Ser Tyr Gin Phe Phe Ala Leu Leu Arg Thr Thr 
385 390 395 400 



tac age aat aat 
Tyr Ser Asn Asn 



<210> 80 
<211> 404 
<212> PRT 

<213> Drosophila melanogaster 
<400> 80 

Met Glu Thr Ala Lys Asp Asn Thr Ala Arg Thr Phe Met Glu Leu Met 
15 10 15 

Arg Val Pro Val Gin Phe Tyr Arg Thr lie Gly Glu Asp lie Tyr Ala 
20 25 30 

His Arg Ser Thr Asn Pro Leu Lys Ser Leu Leu Phe Lys lie Tyr Leu 
35 40 45 

Tyr Ala Gly Phe lie Asn Phe Asn Leu Leu Val lie Gly Glu Leu Val 
50 55 60 

Phe Phe Tyr Asn Ser lie Gin Asp Phe Glu Thr lie Arg Leu Ala lie 

65 70 75 80 

Ala Val Ala Pro Cys lie Gly Phe Ser Leu Val Ala Asp Phe Lys Gin 
85 90 95 

Ala Ala Met lie Arg Gly Lys Lys Thr Leu lie Met Leu Leu Asp Asp 
100 105 110 

Leu Glu Asn Met His Pro Lys Thr Leu Ala Lys Gin Met Glu Tyr Lys 
115 120 125 

Leu Pro Asp Phe Glu Lys Thr Met Lys Arg Val lie Asn lie Phe Thr 
130 135 140 

Phe Leu Cys Leu Ala Tyr Thr Thr Thr Phe Ser Phe Tyr Pro Ala lie 
145 150 155 160 

Lys Ala Ser Val Lys Phe Asn Phe Leu Gly Tyr Asp Thr Phe Asp Arg 

161 



165 



170 



175 



Asn Phe Gly Phe Leu lie Trp Phe Pro Phe Asp Ala Thr Arg Asn Asn 
180 185 190 

Leu lie Tyr Trp lie Met Tyr Trp Asp lie Ala His Gly Ala Tyr Leu 
195 200 . 205 

Ala Gly lie Ala Phe Leu Cys Ala Asp Leu Leu Leu Val Val Val lie 
210 215 220 

Thr Gin lie Cys Met His Phe Asn Tyr lie Ser Met Arg Leu Glu Asp 

225 230 235 240 

His Pro Cys Asn Ser Asn Glu Asp Lys Glu Asn lie Glu Phe Leu lie 
245 250 255 

Gly lie lie Arg Tyr His Asp Lys Cys Leu Lys Leu Cys Glu His Val 
260 265 270 

Asn Asp Leu Tyr Ser Phe Ser Leu Leu Leu Asn Phe Leu Met Ala Ser 
275 280 285 

Met Gin lie Cys Phe He Ala Phe Gin Val Thr Glu Ser Thr Val Glu 
290 295 300 

Val He He He Tyr Cys He Phe Leu Met Thr Ser Met Val Gin Val 
305 310 315 320 

Phe Met Val Cys Tyr Tyr Gly Asp Thr Leu He Ala Ala Ser Leu Lys 
325 330 335 

Val Gly Asp Ala Ala Tyr Asn Gin Lys Trp Phe Gin Cys Ser Lys Ser 
340 345 350 

Tyr Cys Thr Met Leu Lys Leu Leu He Met Arg Ser Gin Lys Pro Ala 
355 360 365 

Ser He Arg Pro Pro Thr Phe Pro Pro He Ser Leu Val Thr Tyr Met 
370 375 380 

Lys Val He Ser Met Ser Tyr Gin Phe Phe Ala Leu Leu Arg Thr Thr 
385 390 395 400 

Tyr Ser Asn Asn 
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<210> 81 
<211> 1179 
<212> DNA 

<213> Drosophila melanogaster 



cag tac age tac gag gat ttc get cga ttg ccc acg 48 
Gin Tyr Ser Tyr Glu Asp Phe Ala Arg Leu Pro Thr 
5 10 15 



<220> 
<221> CDS 
<222> (1) . . (1179) 
<223> DORLU 14.1 

<400> 81 
atg gaa cct gtg 
Met Glu Pro Val 
1 

acg gtg ttc 
Thr Val Phe 



atg ggc tac gac 
Met Gly Tyr Asp 
25 



atg ctg ggc gtt 
Met Leu Gly Val 



atg egg tat gec 
Met Arg Tyr Ala 
75 



ccg aag acc 96 
Pro Lys Thr 
30 

144 



192 



act ctg gtc 240 
Thr Leu Val 
80 



cgc tct cgc agg ata eta tac tgg 
Arg Ser Arg Arg lie Leu Tyr Trp 
35 40 

age cat ggg gtc tgt gta gga gtc 
Ser His Gly Val Cys Val Gly Val 
50 55 



ata tat cgt ttc etc tgt etc gec 
lie Tyr Arg Phe Leu Cys Leu Ala 
45 

atg gta ttt cgt atg gtg gag gca 
Met Val Phe Arg Met Val Glu Ala 
60 



tgg ate 
Trp lie 
20 

agg ata 
Arg lie 



gtc tgt 
Val Cys 



aag acc att gac aat gtt teg ctg ate 
Lys Thr lie Asp Asn Val Ser Leu lie 
65 70 



acc tat ate ate aac teg gat acg aaa ttc gca act gtc tta caa agg 288 
Thr Tyr lie lie Asn Ser Asp Thr Lys Phe Ala Thr Val Leu Gin Arg 
85 90 95 

agt gca att caa agt eta aac tea aaa ctg gee gaa eta tat ccg aag 336 
Ser Ala lie Gin Ser Leu Asn Ser Lys Leu Ala Glu Leu Tyr Pro Lys 
100 105 110 

acc acg ctg gac agg ate tat cac egg gtg aat gat cac tat tgg acc 38 4 
Thr Thr Leu Asp Arg lie Tyr His Arg Val Asn Asp His Tyr Trp Thr 
115 120 125 

aag tea ttt gta tat ttg gtt att ate tac att ggt teg teg att atg 432 
Lys Ser Phe Val Tyr Leu Val lie lie Tyr lie Gly Ser Ser lie Met 
130 135 140 
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gtt gtt att gga ccg att att acg teg att ata get tac ttc acg cac 480 
Val Val lie Gly Pro lie lie Thr Ser lie lie Ala Tyr Phe Thr His 
145 150 155 160 



aac gtt ttc acc tac atg cac tgc tat ccg tac ttt ttg tat gat cct 
Asn Val Phe Thr Tyr Met His Cys Tyr Pro Tyr Phe Leu Tyr Asp Pro 
165 170 175 



gag aag gat ccg gtt tgg ate tac ate age ate tat get ctg gaa tgg 57 6 
Glu Lys Asp Pro Val Trp lie Tyr lie Ser lie Tyr Ala Leu Glu Trp 
180 185 190 



ttg cac age aca cag atg gtc att teg aac att ggc gcg gat -ate tgg 
Leu His Ser Thr Gin Met Val lie Ser Asn lie Gly Ala Asp lie Trp 
195 200 205 



ctg ctg tac ttt cag gtg cag ata aat etc cac ttc agg ggc att ata 
Leu Leu Tyr Phe Gin Val Gin He Asn Leu His Phe Arg Gly He He 
210 215 220 



cga tea ctg gcg gat cac aag ccc agt gtg aag cac gac cag gag gac 720 
Arg Ser Leu Ala Asp His Lys Pro Ser Val Lys His Asp Gin Glu Asp 
225 230 , 235 240 



agg aaa ttc att gcg aaa att gtc gac aag cag gtg cac ctg gtc agt 
Arg Lys Phe He Ala Lys He Val Asp Lys Gin Val His Leu Val Ser 
245 250 255 



ttg caa aac gat ctg aat ggt ate ttt gga aaa teg ctg ctt eta age 816 
Leu Gin Asn Asp Leu Asn Gly He Phe Gly Lys Ser Leu Leu Leu Ser 
260 265 270 



ctg ctg acc acc gca gcg gtt ate tgc acg gtg gcg gtg tac act ctg 8 64 
Leu Leu Thr Thr Ala Ala Val He Cys Thr Val Ala Val Tyr Thr Leu 
275 280 285 



att cag ggt ccc acc ttg gag ggc ttc acc tat gtg ate ttc ate ggg 912 
He Gin Gly Pro Thr Leu Glu Gly Phe Thr Tyr Val lie Phe He Gly 
290 295 300 



act tct gtg atg cag gtc tac ctg gtg tgc tat tac ggt cag caa gtt 960 
Thr Ser Val Met Gin Val Tyr Leu Val Cys Tyr Tyr Gly Gin Gin Val 
305 310 315 320 



etc gac ttg gtg gag cgc gag gtg gee cac gee gtg tac aat cat 
Leu Asp Leu Val Glu Arg Glu Val Ala His Ala Val Tyr Asn His . 

325 330 335 
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ttt 

Phe 



cac gat get tct ata gcg 
His Asp Ala Ser lie Ala 
340 



tac aag agg tac ctg etc ata ate att 1056 
Tyr Lys Arg Tyr Leu Leu lie lie lie 
345 350 



ate agg gcg cag cag ccc gtg gaa ctt aat gec atg ggc tac ctg tec 1104 
He Arg Ala Gin Gin Pro Val Glu Leu Asn Ala Met Gly Tyr Leu Ser 
355 360 365 

att teg ctg gac acc ttt aaa cag ctg atg age gtc tec tac egg gtt 1152 
He Ser Leu Asp Thr Phe Lys Gin Leu Met Ser Val Ser Tyr Arg Val 
370 375 380 



ata acc atg etc atg cag atg att cag 
He Thr Met Leu Met Gin Met He Gin 
385 390 



<210> 82 
<211> 393 
<212> PRT 

<213> Drosophila melanogaster 
<400> 82 

Met Glu Pro Val Gin Tyr Ser Tyr Glu Asp Phe Ala Arg Leu Pro Thr 
15 10 15 

Thr Val Phe Trp He Met Gly Tyr Asp Met Leu Gly Val Pro Lys Thr 



Arg Ser Arg Arg He Leu Tyr Trp He Tyr Arg Phe Leu Cys Leu Ala 

35 40 45 

Ser His Gly Val Cys Val Gly Val Met Val Phe Arg Met Val Glu Ala 

50 55 60 

Lys Thr He Asp Asn Val Ser Leu He Met Arg Tyr Ala Thr Leu Val 

65 70 75 80 

Thr Tyr He lie Asn Ser Asp Thr Lys Phe Ala Thr Val Leu Gin Arg 

85 90 95 

Ser Ala He Gin Ser Leu Asn Ser Lys Leu Ala Glu Leu Tyr Pro Lys 

100 105 110 

Thr Thr Leu Asp Arg He Tyr His Arg Val Asn Asp His Tyr Trp Thr 

115 120 ■ 125 

Lys Ser Phe Val Tyr Leu Val He He Tyr He Gly Ser Ser He Met 
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130 



135 



140 



Val Val He Gly Pro 
145 

Asn Val Phe Thr Tyr 
165 

Glu Lys Asp Pro Val 

180 

Leu His Ser Thr Gin 
195 

Leu Leu Tyr Phe Gin 
210 

Arg Ser Leu Ala Asp 
225 

Arg Lys Phe He Ala 
245 

Leu Gin Asn Asp Leu 
260 

Leu Leu Thr Thr Ala 
275 

He Gin Gly Pro Thr 
290 

Thr Ser Val Met Gin 
305 

Leu Asp Leu Val Glu 
325 

Phe His Asp Ala Ser 
340 

He Arg Ala Gin Gin 
355 

He Ser Leu Asp Thr 
370 

He Thr Met Leu Met 



He He Thr Ser He He 
150 155 

Met His Cys Tyr Pro Tyr 
170 

Trp He Tyr He Ser He 
185 

Met Val He Ser Asn He 
200 

Val Gin He Asn Leu His 
215 

His Lys Pro Ser Val Lys 
230 235 

Lys He Val Asp Lys Gin 
250 

Asn Gly He Phe Gly Lys 
265 

Ala Val He Cys Thr Val 
280 

Leu Glu Gly Phe Thr Tyr 
295 

Val Tyr Leu Val Cys Tyr 
310 315 

Arg Glu Val Ala His Ala 
330 

He Ala Tyr Lys Arg Tyr 
345 

Pro Val Glu Leu Asn Ala 
360 

Phe Lys Gin Leu Met Ser 
375 

Gin Met He Gin 



Ala Tyr Phe Thr His 

160 

Phe Leu Tyr Asp Pro 
175 

Tyr Ala Leu Glu Trp 
190 

Gly Ala Asp He Trp 
205 

Phe Arg Gly He He 
220 

His Asp Gin Glu Asp 
240 

Val His Leu Val Ser 
255 

Ser Leu Leu Leu Ser 
270 

Ala Val Tyr Thr Leu 
285 

Val He Phe He Gly 
300 

Tyr Gly Gin Gin Val 
320 

Val Tyr Asn His Asp 
335 

Leu Leu He He He 
350 

Met Gly Tyr Leu Ser 
365 

Val Ser Tyr Arg Val 
380 
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385 



390 



<210> 83 
<211> 1134 
<212> DNA 

<213> Drosophila melanogaster 



<220> 
<221> CDS 
<222> (1) . . (1134) 
<223> DORLU 15.1 



<400> 83 

atg gac gcc age tac ttt gec gtc cag aga aga get ctg gaa ata gtt 48 
Met Asp Ala Ser Tyr Phe Ala Val Gin Arg Arg Ala Leu Glu lie Val 
15 10 15 



gga ttc gat ccc agt act ccg caa ctg agt ctg aaa cat ccc ate tgg 
Gly Phe Asp Pro Ser Thr Pro Gin Leu Ser Leu Lys His Pro lie Trp 
20 25 30 



gcc ggg att etc ate ctg tec ttg ate tct cac aac tgg ccc atg gta 
Ala Gly lie Leu He Leu Ser Leu He Ser His Asn Trp Pro Met Val 



gtc tat gcc ctg cag gat etc tec gac ttg ace cgt ctg acg gac aac 
Val Tyr Ala Leu Gin Asp Leu Ser Asp Leu Thr Arg Leu Thr Asp Asn 
50 55 60 



ttt gcg gtg ttt atg caa gga tea cag age acc ttc aag ttc ctg gtc 240 
Phe Ala Val Phe Met Gin Gly Ser Gin Ser Thr Phe Lys Phe Leu Val 
65 70 75 80 



atg atg gcg aaa cga agg cgc att gga teg ttg att cac cgt ttg cat 288 
Met Met Ala Lys Arg Arg Arg He Gly Ser Leu He His Arg Leu His 
85 90 95 



aag eta aac cag gcg gcc agt gcc acg ccc aat cac ctg gag aag ate 
Lys Leu Asn Gin Ala Ala Ser Ala Thr Pro Asn His Leu Glu Lys He 
100 105 110 



gag agg gaa aac caa ctg gat agg tat gtc gcc agg tec ttt aga aat 
Glu Arg Glu Asn Gin Leu Asp Arg Tyr Val Ala Arg Ser Phe Arg Asn 
115 120 125 



gcc gcc tac gga gtg att tgt gcc teg gcc ata gcg ccc atg ttg ctt 432 
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Ala Ala Tyr Gly Val lie Cys Ala Ser Ala lie Ala Pro Met Leu Leu 
130 135 140 



ggc ctg tgg gga tat gtg gag acg ggt gta ttt acc ccg acc aca ccc 
Gly Leu Trp Gly Tyr Val Glu Thr Gly Val Phe Thr Pro Thr Thr Pro 
145 150 155 . 160 

atg gag ttc aac ttc tgg ctg gac gag cga aag cct cac ttt tat tgg 
Met Glu Phe Asn Phe Trp Leu Asp Glu Arg Lys Pro His Phe Tyr Trp 
165 170 175 

ccc ate tac gtt tgg ggc gta ctg ggc gtg gca get gec gec tgg ttg 
Pro He Tyr Val Trp Gly Val Leu Gly Val Ala Ala Ala Ala Trp Leu 
180 185 190 

gec att gca acg gac acc ctg ttc tec tgg ctg act cac aat gtg gtg 
Ala He Ala Thr Asp Thr Leu Phe Ser Trp Leu Thr His Asn Val Val 
195 200 205 

att cag ttc caa eta ctg gag ctt gtt etc gaa gag aag gat ctg aat 
He Gin Phe Gin Leu Leu Glu Leu Val Leu Glu Glu Lys Asp Leu Asn 
210 215 220 

ggc gga gac tct cgc ctg acc ggg ttt gtt agt cgt cat cgt ata get 
Gly Gly Asp Ser Arg Leu Thr Gly Phe Val Ser Arg His Arg lie Ala 
225 230 235 240 

ctg gat ttg gec aag gaa eta agt teg att ttc ggg gag ate gtc ttt 
Leu Asp Leu Ala Lys Glu Leu Ser Ser He Phe Gly Glu He Val Phe 
245 250 255 

gtg aaa tac atg etc agt tac ctg caa etc tgc atg ttg gec ttt cgc 
Val Lys Tyr Met Leu Ser Tyr Leu Gin Leu Cys Met Leu Ala Phe Arg 
260 265 270 

ttc age cgc agt ggc tgg agt gee cag gtg cca ttt aga gee acc ttc 
Phe Ser Arg Ser Gly Trp Ser Ala Gin Val Pro Phe Arg Ala Thr Phe 
275 280 285 

eta gtg gee ate ate ate caa ctg agt teg tat tgc tat gga ggc gag 
Leu Val Ala He He He Gin Leu Ser Ser Tyr Cys Tyr Gly Gly Glu 
290 295 300 

tat ata aag cag caa agt ttg gee ate gca caa gee gtt tat ggt caa 
Tyr He Lys Gin Gin Ser Leu Ala He Ala Gin Ala Val Tyr Gly Gin 
305 310 315 320 

ate aat tgg cca gaa atg acg cca aag aaa aga aga etc tgg caa atg 
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lie Asn Trp Pro Glu Met Thr Pro Lys Lys Arg Arg Leu Trp Gin Met 
325 330 335 



gtg ate atg agg gcg cag cga ccg get aag att ttt gga ttc atg ttc 1056 
Val lie Met Arg Ala Gin Arg Pro Ala Lys lie Phe Gly Phe Met Phe 
340 345 350 

gtt gtg gac ttg cca ctg ctg ctt tgg gtc ate aga act. gcg ggc tea 1104 
Val Val Asp Leu Pro Leu Leu Leu Trp Val lie Arg Thr Ala Gly Ser 
355 360 365 



ttt ctg gee atg ctt agg act ttc gag cgt 
Phe Leu Ala Met Leu Arg Thr Phe Glu Arg 
370 375 



<210> 84 
<211> 378 
<212> PRT 

<213> Drosophila melanogaster 
<400> 84 

Met Asp Ala Ser Tyr Phe Ala Val Gin Arg Arg Ala Leu Glu lie Val 



Gly Phe Asp Pro Ser Thr Pro Gin Leu Ser Leu Lys His Pro lie Trp 
20 25 30 

Ala Gly lie Leu lie Leu Ser Leu lie Ser His Asn Trp Pro Met Val 



Val Tyr Ala Leu Gin Asp Leu Ser Asp Leu Thr Arg Leu Thr Asp Asn 



Phe Ala Val Phe Met Gin Gly Ser Gin Ser Thr Phe Lys Phe Leu Val 
65 70 75 80 

Met Met Ala Lys Arg Arg Arg lie Gly Ser Leu lie His Arg Leu His 
85 90 95 

Lys Leu Asn Gin Ala Ala Ser Ala Thr Pro Asn His Leu Glu Lys lie 
100 105 110 

Glu Arg Glu Asn Gin Leu Asp Arg Tyr Val Ala Arg Ser Phe Arg Asn 
115 120 125 

Ala Ala Tyr Gly Val lie Cys Ala Ser Ala lie Ala Pro Met Leu Leu 
130 135 140 
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Gly Leu Trp Gly Tyr Val Glu Thr Gly Val Phe Thr Pro Thr Thr Pro 
145 150 155 160 



Met Glu Phe Asn Phe Trp Leu Asp Glu Arg Lys Pro His Phe Tyr Trp 
165 170 175 

Pro lie Tyr Val Trp Gly Val Leu Gly Val Ala Ala Ala Ala Trp Leu 
180 185 190 

Ala lie Ala Thr Asp Thr Leu Phe Ser Trp Leu Thr His Asn Val Val 
195 200 205 

lie Gin Phe Gin Leu Leu Glu Leu Val Leu Glu Glu Lys Asp Leu Asn 
210 215 220 

Gly Gly Asp Ser Arg Leu Thr Gly Phe Val Ser Arg His Arg lie Ala 
225 230 235 240 

Leu Asp Leu Ala Lys Glu Leu Ser Ser lie Phe Gly Glu lie Val Phe 
245 250 255 

Val Lys Tyr Met Leu Ser Tyr Leu Gin Leu Cys Met Leu Ala Phe Arg 
260 265 270 

Phe Ser Arg Ser Gly Trp Ser Ala Gin Val Pro Phe Arg Ala Thr Phe 
275 280 285 

Leu Val Ala lie lie lie Gin Leu Ser Ser Tyr Cys Tyr Gly Gly Glu 
290 295 300 

Tyr lie Lys Gin Gin Ser Leu Ala lie Ala Gin Ala Val Tyr Gly Gin 
305 310 315 320 

lie Asn Trp Pro Glu Met Thr Pro Lys Lys Arg Arg Leu Trp Gin Met 
325 330 335 

Val lie Met Arg Ala Gin Arg Pro Ala Lys He Phe Gly Phe Met Phe 
340 345 350 

Val Val Asp Leu Pro Leu Leu Leu Trp Val lie Arg Thr Ala Gly Ser 
355 360 365 

Phe Leu Ala Met Leu Arg Thr Phe Glu Arg 
370 375 
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<210> 85 
<211> 1065 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1065) 

<223> DORLU 16.1 

<400> 85 

atg gaa aaa eta cgt tec tat gag gat ttc ate ttc atg gec aac atg 48 
Met Glu Lys Leu Arg Ser Tyr Glu Asp Phe lie Phe Met Ala Asn Met 
15 10 15 

atg ttc aag acc ctt ggc tac gat eta ttc cat aca ccc aaa ccc tgg 96 
Met Phe Lys Thr Leu Gly Tyr Asp Leu Phe His Thr Pro Lys Pro Trp 
20 25 30 

tgg cgc tat ctg ctt gtg cga gga tac ttc gtt ttg tgc acg ate age 144 
Trp Arg Tyr Leu Leu Val Arg Gly Tyr Phe Val Leu Cys Thr lie Ser 
35 40 45 

aac ttt tac gag get tec atg gtg acg aca agg ata att gag tgg gaa 192 
Asn Phe Tyr Glu Ala Ser Met Val Thr Thr Arg lie lie Glu Trp Glu 
50 55 60 

tec ttg gee gga agt ccc tec aaa ata atg cga cag ggt ctg cac ttc 240 
Ser Leu Ala Gly Ser Pro Ser Lys lie Met Arg Gin Gly Leu His Phe 
65 70 75 80 

ttt tac atg ttg agt age caa ttg aaa ttt ate aca ttc atg ata aat 288 
Phe Tyr Met Leu Ser Ser Gin Leu Lys Phe lie Thr Phe Met lie Asn 
85 90 95 

cgc aaa cgc eta ctg cag ctg age cat cgt ttg aaa gag ttg tat cct 336 
Arg Lys Arg Leu Leu Gin Leu Ser His Arg Leu Lys Glu Leu Tyr Pro 
100 105 110 

cat aaa gag caa aat caa agg aag tac gag gtg aat aaa tac tac eta 384 
His Lys Glu Gin Asn Gin Arg Lys Tyr Glu Val Asn Lys Tyr Tyr Leu 
115 120 125 

tec tgt tec acg cgc aat gtt ttg tac gtg tac tac ttt gta atg gtc 432 
Ser Cys Ser Thr Arg Asn Val Leu Tyr Val Tyr Tyr Phe Val Met Val 
130 135 140 

gtc atg gca ctg gaa ccc etc gtt cag teg tgc att ate cag ttc ata 480 
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Val Met Ala Leu Glu Pro Leu Val Gin Ser Cys lie lie Gin Phe lie 
145 150 155 160 

gtg aat gtg age ctg ggc aca gat ctg tgg atg atg tgc gtc tea age 528 
Val Asn Val Ser Leu Gly Thr Asp Leu Trp Met Met Cys Val Ser Ser 
165 170 175 

caa ata teg atg cac ttg ggc tat ctg gec aat atg ttg gec tec att 576 
Gin lie Ser Met His Leu Gly Tyr Leu Ala Asn Met Leu Ala Ser lie 
180 185 190 

cga cca agt cca gaa acg gaa caa caa gac tgt gac ttc ttg gec age 624 
Arg Pro Ser Pro Glu Thr Glu Gin Gin Asp Cys Asp Phe Leu Ala Ser 
195 200 205 

att ata aag aga cat caa eta atg ate agg ctt caa aag gac gtg aac 672 
lie lie Lys Arg His Gin Leu Met lie Arg Leu Gin Lys Asp Val Asn 
210 215 220 

tat gtt ttt gga etc tta ttg gca tct aat ctg ttt ace aca tec tgt 720 
Tyr Val Phe Gly Leu Leu Leu Ala Ser Asn Leu Phe Thr Thr Ser Cys 
225 .230 235 240 

tta ctt tgc tgc atg gcg tac tat ace gtc gtc gaa ggt ttc aat tgg 768 
Leu Leu Cys Cys Met Ala Tyr Tyr Thr Val Val Glu Gly Phe Asn Trp 
245 250 255 

gag ggc att tec tat atg atg etc ttt get agt gta get gee cag ttc 816 
Glu Gly lie Ser Tyr Met Met Leu Phe Ala Ser Val Ala Ala Gin Phe 
260 265 270 

tac gtt gtc age tea cac gga caa atg tta ata gat ttg agt aca aat 8 64 
Tyr Val Val Ser Ser His Gly Gin Met Leu lie Asp Leu Ser Thr Asn 
275 280 285 

tta gec aag get gec ttt gaa age aag tgg tat gaa gga tct ttg cga 912 
Leu Ala Lys Ala Ala Phe Glu Ser Lys Trp Tyr Glu Gly Ser Leu Arg 
290 295 300 

taq aaa aag gag ata etc att eta atg gca cag get caa cga cct ttg 960 
Tyr Lys Lys Glu lie Leu lie Leu Met Ala Gin Ala Gin Arg Pro Leu 
305 310 315 320 

gag att tea gec agg gga gta att ate ata tec etc gac ace ttt aaa 1008 
Glu lie Ser Ala Arg Gly Val lie lie lie Ser Leu Asp Thr Phe Lys 
325 330 335 

ata ttg atg ace ate aca tac aga ttt ttc gcg gtt ata cga caa act 1056 
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He Leu Met Thr He Thr Tyr Arg Phe Phe Ala Val He Arg Gin Thr 
340 345 350 



gta gaa aag 
Val Glu Lys 
355 



<210> 86 
<211> 355 
<212> PRT 

<213> Drosophila melanogaster 
<400> 86 

Met Glu Lys Leu Arg Ser Tyr Glu Asp Phe lie Phe Met Ala Asn Met 
15 10 15 

Met Phe Lys Thr Leu Gly Tyr Asp Leu Phe His Thr Pro Lys Pro Trp 
20 25 30 

Trp Arg Tyr Leu Leu Val Arg Gly Tyr Phe Val Leu Cys Thr He Ser 
35 40 45 

Asn Phe Tyr Glu Ala Ser Met Val Thr Thr Arg He He Glu Trp Glu 
50 55 60 

Ser Leu Ala Gly Ser Pro Ser Lys He Met Arg Gin Gly Leu His Phe 
65 70 75 80 

Phe Tyr Met Leu Ser Ser Gin Leu Lys Phe He Thr Phe Met He Asn 
85 90 95 

Arg Lys Arg Leu Leu Gin Leu Ser His Arg Leu Lys Glu Leu Tyr Pro 
100 105 110 

His Lys Glu Gin Asn Gin Arg Lys Tyr Glu Val Asn Lys Tyr Tyr Leu 
115 120 125 

Ser Cys Ser Thr Arg Asn Val Leu Tyr Val Tyr Tyr Phe Val Met Val 
130 135 140 

Val Met Ala Leu Glu Pro Leu Val Gin Ser Cys He He Gin Phe lie 
145 150 155 160 

Val Asn Val Ser Leu Gly Thr Asp Leu Trp Met Met Cys Val Ser Ser 
165 170 175 

Gin He Ser Met His Leu Gly Tyr Leu Ala Asn Met Leu Ala Ser He 
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180 



185 



190 



Arg Pro Ser Pro Glu Thr Glu Gin Gin Asp Cys Asp Phe Leu Ala Ser 
195 200 205 

lie lie Lys Arg His Gin Leu Met lie Arg Leu Gin Lys Asp Val Asn 
210 215 220 

Tyr Val Phe Gly Leu Leu Leu Ala Ser Asn Leu Phe Thr Thr Ser Cys 
225 230 235 240 

Leu Leu Cys Cys Met Ala Tyr Tyr Thr Val Val Glu Gly Phe Asn Trp 
245 250 255 

Glu Gly lie Ser Tyr Met Met Leu Phe Ala Ser Val Ala Ala Gin Phe 
260 265 270 

Tyr Val Val Ser Ser His Gly Gin Met Leu lie Asp Leu Ser Thr Asn 
275 280 285 

Leu Ala Lys Ala Ala Phe Glu Ser Lys Trp Tyr Glu Gly Ser Leu Arg 
290 295 300 

Tyr Lys Lys Glu lie Leu lie Leu Met Ala Gin Ala Gin Arg Pro Leu 
305 310 315 320 

Glu lie Ser Ala Arg Gly Val lie lie lie Ser Leu Asp Thr Phe Lys 
325 330 335 

He Leu Met Thr He Thr Tyr Arg Phe Phe Ala Val He Arg Gin Thr 
340 345 350 

Val Glu Lys 
355 



<210> 87 
<211> 1272 
<212> DNA 

<213> Drosophila melanogaster 

<220> 
<221> CDS 
<222> (1) . . (1272) 
<223> DORLU 22.1 

<400> 87 
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atg ctg acg gac aag ttc etc cga ctg cag tec get tta ttt cgc ctt 
Met Leu Thr Asp Lys Phe Leu Arg Leu Gin Ser Ala Leu Phe Arg Leu 



etc gga etc gaa ttg ttg cac gag cag gat gtt ggc cat cga tat cct 
Leu Gly Leu Glu Leu Leu His Glu Gin Asp Val Gly His Arg Tyr Pro 
20 25 30 

tgg cgc age ate tgc tgc att etc teg gtg gec agt ttc atg ccc ctg 
Trp Arg Ser lie Cys Cys lie Leu Ser Val Ala Ser Phe Met Pro Leu 



acc att gcg ttt ggc ctg caa aac gtc caa aat gtg gag caa tta acc 
Thr lie Ala Phe Gly Leu Gin Asn Val Gin Asn Val Glu Gin Leu Thr 
50 55 60 

gac tea etc tgc teg gtt etc gtg gat ttg ctg gec ctg tgc aaa ate 
Asp Ser Leu Cys Ser Val Leu Val Asp Leu Leu Ala Leu Cys Lys lie 



ggg ctt ttc ctt tgg ctt tac aag gac ttc aag ttc eta ata ggg cag 
Gly Leu Phe Leu Trp Leu Tyr Lys Asp Phe Lys Phe Leu lie Gly Gin 
85 90 95 

ttc tat tgt gtt ttg caa acg gaa acc cac acc get gtc get gaa atg 
Phe Tyr Cys Val Leu Gin Thr Glu Thr His Thr Ala Val Ala Glu Met 
100 105 110 

ata gtg acc agg gaa agt cgt egg gat cag ttc ate agt get atg tat 
lie Val Thr Arg Glu Ser Arg Arg Asp Gin Phe lie Ser Ala Met Tyr 
115 120 125 

gec tac tgt ttc att acg get ggc ctt teg gec tgc ctg atg tec cct 
Ala Tyr Cys Phe lie Thr Ala Gly Leu Ser Ala Cys Leu Met Ser Pro 
130 135 140 

eta tec atg ctg att age tac cac gaa cag gtg aat tgc age cga aat 
Leu Ser Met Leu lie Ser Tyr His Glu Gin Val Asn Cys Ser Arg Asn 
145 150 155 160 

ttc cat ttc cca gtg tgt aag aaa aag tac tgc tta ata tec aga ata 
Phe His Phe Pro Val Cys Lys Lys Lys Tyr Cys Leu lie Ser Arg lie 
165 170 175 

tta aga tac agt ttc tgc aga tat ccc tgg gac aat atg aag ctg tec 
Leu Arg Tyr Ser Phe Cys Arg Tyr Pro Trp Asp Asn Met Lys Leu Ser 
180 185 190 



175 



aac tac ate att tec tat ttc tgg aat gtg tgt get gca ttg ggc gtg 
Asn Tyr lie lie Ser Tyr Phe Trp Asn Val Cys Ala Ala Leu Gly Val 
195 200 205 

gca ctg ccc acc gtt tgt gtg gac aca ctg ttc tgt tct ctg age cat 
Ala Leu Pro Thr Val Cys Val Asp Thr Leu Phe Cys Ser Leu Ser His 
210 215 220 

aat etc tgt gec eta ttc cag att gec agg cac aaa atg atg cac ttt 
Asn Leu Cys Ala Leu Phe Gin lie Ala Arg His Lys Met Met His Phe 
225 230 235 240 

gag ggc aga aat acc aaa gag act cat gag aac tta aag cac gtg ttt 
Glu Gly Arg Asn Thr Lys Glu Thr His Glu Asn Leu Lys His Val Phe 
245 250 255 

caa eta tat gcg ttg tgt ttg aac ctg ggc cat ttc tta aac gaa tat 
Gin Leu Tyr Ala Leu Cys Leu Asn Leu Gly His Phe Leu Asn Glu Tyr 
260 265 270 

ttc aga ccg etc ate tgc cag ttt gtg gca gec tea ctg cac ttg tgt 
Phe Arg Pro Leu lie Cys Gin Phe Val Ala Ala Ser Leu His Leu Cys 
275 280 285 

gtc ctg tgc tac caa ctg tct gee aat ate ctg cag cca gcg tta etc 
Val Leu Cys Tyr Gin Leu Ser Ala Asn lie Leu Gin Pro Ala Leu Leu 
290 295 300 

ttc tat gee gca ttt acg gca gca gtt gtt ggc cag gtg tct ata tac 
Phe Tyr Ala Ala Phe Thr Ala Ala Val Val Gly Gin Val Ser lie Tyr 
305 310 315 320 

tgc ttc tgc gga teg age ate cat teg gag tgt cag eta ttt ggc cag 
Cys Phe Cys Gly Ser Ser lie His Ser Glu Cys Gin Leu Phe Gly Gin 
325 330 335 

gee ate tac gag tec age tgg ccc cat ctg ctg cag gaa aac ctg cag 
Ala lie Tyr Glu Ser Ser Trp Pro His Leu Leu Gin Glu Asn Leu Gin 
340 345 350 

ctt gta age tec tta aaa att gec atg atg cga teg agt ttg gga tgt 
Leu Val Ser Ser Leu Lys lie Ala Met Met Arg Ser Ser Leu Gly Cys 
355 360 365 

ccc ate gat ggt tac ttc ttc gag gee aat egg gag acg etc ate acg 
Pro lie Asp Gly Tyr Phe Phe Glu Ala Asn Arg Glu Thr Leu lie Thr 
370 375 380 
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ate cct ggc eta get 
lie Pro Gly Leu Ala 
385 



ttc egg get ttc 
Phe Arg Ala Phe 
390 



att att cag tgg ttc agt cgt 1200 
He He Gin Trp Phe Ser Arg 
395 400 



teg ggt ttg ttt aac tec gga aat att tac aat tat get tta age egg 1248 
Ser Gly Leu Phe Asn Ser Gly Asn He Tyr Asn Tyr Ala Leu Ser Arg 
405 410 415 

tgt tgt tac age cag ttg get aat 1272 
Cys Cys Tyr Ser Gin Leu Ala Asn 
420 



<210> 88 
<211> 424 
<212> PRT 

<213> Drosophila melanogaster 
<400> 88 

Met Leu Thr Asp Lys Phe Leu Arg Leu Gin Ser Ala Leu Phe Arg Leu 
15 10 15 

Leu Gly Leu Glu Leu Leu His Glu Gin Asp Val Gly His Arg Tyr Pro 
20 -25 30 

Trp Arg Ser lie Cys Cys He Leu Ser Val Ala Ser Phe Met Pro Leu 
35 40 45 

Thr He Ala Phe Gly Leu Gin Asn Val Gin Asn Val Glu Gin Leu Thr 
50 55 60 

Asp Ser Leu Cys Ser Val Leu Val Asp Leu Leu Ala Leu Cys Lys He 
65 70 75 80 

Gly Leu Phe Leu Trp Leu Tyr Lys Asp Phe Lys Phe Leu He Gly Gin 

85 90 95 

Phe Tyr Cys Val Leu Gin Thr Glu Thr His Thr Ala Val Ala Glu Met 
100 105 110 

He Val Thr Arg Glu Ser Arg Arg Asp Gin Phe He Ser Ala Met Tyr 
115 120 125 

Ala Tyr Cys Phe He Thr Ala Gly Leu Ser Ala Cys Leu Met Ser Pro 
130 135 140 

Leu Ser Met Leu He Ser Tyr His Glu Gin Val Asn Cys Ser Arg Asn 
145 150 155 160 
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Phe His Phe Pro Val Cys Lys Lys Lys Tyr Cys Leu lie Ser Arg lie 
165 170 175 



Leu Arg Tyr Ser Phe Cys Arg Tyr Pro Trp Asp Asn Met Lys Leu Ser 
180 185 190 

Asn Tyr lie lie Ser Tyr Phe Trp Asn Val Cys Ala Ala Leu Gly Val 
195 200 205 

Ala Leu Pro Thr Val Cys Val Asp Thr Leu Phe Cys Ser Leu Ser His 
210 215 220 

Asn Leu Cys Ala Leu Phe Gin lie Ala Arg His Lys Met Met His Phe 
225 230 235 240 

Glu Gly Arg Asn Thr Lys Glu Thr His Glu Asn Leu Lys His Val Phe 
245 250 255 

Gin Leu Tyr Ala Leu Cys Leu Asn Leu Gly His Phe Leu Asn Glu Tyr 
260 265 270 

Phe Arg Pro Leu lie Cys Gin Phe Val Ala Ala Ser Leu His Leu Cys 
275 280 285 

Val Leu Cys Tyr Gin Leu Ser Ala Asn lie Leu Gin Pro Ala Leu Leu 
290 295 300 

Phe Tyr Ala Ala Phe Thr Ala Ala Val Val Gly Gin Val Ser lie Tyr 
305 310 315 320 

Cys Phe Cys Gly Ser Ser He His Ser Glu Cys Gin Leu Phe Gly Gin 
325 330 335 

Ala lie Tyr Glu Ser Ser Trp Pro His Leu Leu Gin Glu Asn Leu Gin 
340 345 350 

Leu Val Ser Ser Leu Lys He Ala Met Met Arg Ser Ser Leu Gly Cys 
355 360 365 

Pro He Asp Gly Tyr Phe Phe Glu Ala Asn Arg Glu Thr Leu He Thr 
370 375 380 

He Pro Gly Leu Ala Phe Arg Ala Phe He He Gin Trp Phe Ser Arg 
385 390 395 400 

Ser Gly Leu Phe Asn Ser Gly Asn He Tyr Asn Tyr Ala Leu Ser Arg 
405 410 415 

178 



em 



Cys Cys Tyr Ser Gin Leu Ala Asn 
420 



<210> 89 
<211> 1176 
<212> DNA 

<213> Drosophila melanogaster 



<220> 

<221> CDS 

<222> (1) . . (1176) 

<223> DORLU 24.1 



<400> 89 

atg tea aag eta ate gag gtg ttt ctg ggt aat ctg tgg acg cag cgt 48 
Met Ser Lys Leu lie Glu Val Phe Leu Gly Asn Leu Trp Thr Gin Arg 
15 10 15 



ttt acc ttc gec cga atg ggt ttg gat ttg cag ccc gat aaa aag ggc 96 

Phe Thr Phe Ala Arg Met Gly Leu Asp Leu Gin Pro Asp Lys Lys Gly 

20 25 30 

aat gtt ttg cga tct ccg ctt ctt tat tgt att atg tgt ctg aca aca 144 

Asn Val Leu Arg Ser Pro Leu Leu Tyr Cys lie Met Cys Leu Thr Thr 

35 40 45 



age ttt gag etc tgc acc gtg tgc gee ttt atg gtc caa aat cgc aac 192 
Ser Phe Glu Leu Cys Thr Val Cys Ala Phe Met Val Gin Asn Arg Asn 
50 55 60 



caa ate gtg ctt tgt tec gag gee ctg atg cac gga eta cag atg gtc 240 
Gin lie Val Leu Cys Ser Glu Ala Leu Met His Gly Leu Gin Met Val 
65 70 75 80 



tec teg eta ctg aag atg get ata ttc ttg gec aaa tct cac gac ctg 288 
Ser Ser Leu Leu Lys Met Ala lie Phe Leu Ala Lys Ser His Asp Leu 
85 90 95 



gtg gac eta att caa cag att cag teg cct ttt aca gag gag gat ctt 336 
Val Asp Leu He Gin Gin He Gin Ser Pro Phe Thr Glu Glu Asp Leu 
100 105 110 



gta ggt aca gag tgg aga tec caa aat caa agg gga caa eta atg get 384 
Val Gly Thr Glu Trp Arg Ser Gin Asn Gin Arg Gly Gin Leu Met Ala 
115 120 125 
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gcc att tac ttt atg atg tgt gcc ggt acg agt gtg tea ttt ctg ttg 432 
Ala He Tyr Phe Met Met Cys Ala Gly Thr Ser Val Ser Phe Leu Leu 
130 135 140 



atg cca gtg get ttg acc atg ctt aag tac cat tec act ggg gaa ttc 
Met Pro Val Ala Leu Thr Met Leu Lys Tyr His Ser Thr Gly Glu Phe 
145 150 155 160 



gcg cct gtc age teg ttc egg gtt ctg ctt cca tac gat gtg aca caa 
Ala Pro Val Ser Ser Phe Arg Val Leu Leu Pro Tyr Asp Val Thr Gin 
165 170 175 



ccg cat gtt tat gcc atg gac tgc tgc ttg atg gta ttt gtg tta agt 
Pro His Val Tyr Ala Met Asp Cys Cys Leu Met Val Phe Val Leu Ser 
180 185 190 



ttt ttt tgc tgc tec acc acc gga gtg gat acc tta tat gga tgg tgt 624 
Phe Phe Cys Cys Ser Thr Thr Gly Val Asp Thr Leu Tyr Gly Trp Cys 
195 200 205 



get tta ggc gtg agt tta caa tac cgt cgc etc ggt caa caa ctt aaa 
Ala Leu Gly Val Ser Leu Gin Tyr Arg Arg Leu Gly Gin Gin Leu Lys 
210 215 220 



agg ata ccc tec tgt ttc aat cca tct egg tct gac ttt gga tta agt 720 
Arg He Pro Ser Cys Phe Asn Pro Ser Arg Ser Asp Phe Gly Leu Ser 
225 230 235 240 



ggg att ttt gtg gag cat get cgt ctg ctt aaa ata gtc caa cat ttt 768 
Gly He Phe Val Glu His Ala Arg Leu Leu Lys He Val Gin His Phe 
245 250 255 



aat tat agt ttt atg gag ate gca ttt gtg gag gtt gtt ata ate tgt 816 
Asn Tyr Ser Phe Met Glu He Ala Phe Val Glu Val Val He He Cys 
260 265 270 



gga etc tat tgc tea gta att tgt cag tat ata atg cca cac acc aac 864 
Gly Leu Tyr Cys Ser Val He Cys Gin Tyr He Met Pro His Thr Asn 
275 280 285 



caa aac ttc gcc ttt ctg ggt ttc ttt tea ttg gta gtt acc aca cag 912 
Gin Asn Phe Ala Phe Leu Gly Phe Phe Ser Leu Val Val Thr Thr Gin 
290 295 300 



ctg tgc ate tat ctt ttc ggt gcc gaa cag gtc cgt ttg gag get gag 960 
Leu Cys He Tyr Leu Phe Gly Ala Glu Gin Val Arg Leu Glu Ala Glu 
305 310 315 320 



180 



cga ttt tec egg ctg eta tac gaa gta att cct tgg caa aac ctt cct 1008 
Arg Phe Ser Arg Leu Leu Tyr Glu Val He Pro Trp Gin Asn Leu Pro 
325 330 335 

cct aaa cac egg aaa ctt ttc ctt ttt cca att gag cgc gee caa cga 1056 
Pro Lys His Arg Lys Leu Phe Leu Phe Pro He Glu Arg Ala Gin Arg 
340 345 350 

gaa act gtt etc ggt get tat ttc ttc gaa eta ggc aga cct ctt ctt 1104 
Glu Thr Val Leu Gly Ala Tyr Phe Phe Glu Leu Gly Arg Pro Leu Leu 
355 360 365 

gtt tgg ata ttt cgc aca gca ggc tct ttt aca act ttg atg aac get 1152 
Val Trp He Phe Arg Thr Ala Gly Ser Phe Thr Thr Leu Met Asn Ala 
370 375 380 

etc tac gca aaa tac gaa acg cat 1176 
Leu Tyr Ala Lys Tyr Glu Thr His 
385 390 



<210> 90 
<211> 392 
<212> PRT 

<213> Drosophila melanogaster 
<400> 90 

Met Ser Lys Leu He Glu Val Phe Leu Gly Asn Leu Trp Thr Gin Arg 



Phe Thr Phe Ala Arg Met Gly Leu Asp Leu Gin Pro Asp Lys Lys Gly 



Asn Val Leu Arg Ser Pro Leu Leu Tyr Cys He Met Cys Leu Thr Thr 
35 40 45 

Ser Phe Glu Leu Cys Thr Val Cys Ala Phe Met Val Gin Asn Arg Asn 



Gin He Val Leu Cys Ser Glu Ala Leu Met His Gly Leu Gin Met Val 

65 70 75 80 

Ser Ser Leu Leu Lys Met Ala He Phe Leu Ala Lys Ser His Asp Leu 

85 90 95 

Val Asp Leu He Gin Gin He Gin Ser Pro Phe Thr Glu Glu Asp Leu 

100 105 110 
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Val Gly Thr Glu Trp Arg Ser Gin Asn Gin Arg Gly Gin Leu Met Ala 
115 120 125 



Ala lie Tyr Phe Met Met Cys Ala Gly Thr Ser Val Ser Phe Leu Leu 
130 135 140 

Met Pro Val Ala Leu Thr Met Leu Lys Tyr His Ser Thr Gly Glu Phe 
145 150 155 160 

Ala Pro Val Ser Ser Phe Arg Val Leu Leu Pro Tyr Asp Val Thr Gin 
165 170 175 

Pro His Val Tyr Ala Met Asp Cys Cys Leu Met Val Phe Val Leu Ser 
180 185 190 

Phe Phe Cys Cys Ser Thr Thr Gly Val Asp Thr Leu Tyr Gly Trp Cys 
195 200 205 

Ala Leu Gly Val Ser Leu Gin Tyr Arg Arg Leu Gly Gin Gin Leu Lys 
210 215 220 

Arg He Pro Ser Cys Phe Asn Pro Ser Arg Ser Asp Phe Gly Leu Ser 
225 230 235 240 

Gly He Phe Val Glu His Ala Arg Leu Leu Lys He Val Gin His Phe 
245 250 255 

Asn Tyr Ser Phe Met Glu He Ala Phe Val Glu Val Val He He Cys 
260 265 270 

Gly Leu Tyr Cys Ser Val He Cys Gin Tyr He Met Pro His Thr Asn 
275 280 285 

Gin Asn Phe Ala Phe Leu Gly Phe Phe Ser Leu Val Val Thr Thr Gin 
290 295 300 

Leu Cys He Tyr Leu Phe Gly Ala Glu Gin Val Arg Leu Glu Ala Glu 
305 310 315 320 

Arg Phe Ser Arg Leu Leu Tyr Glu Val He Pro Trp Gin Asn Leu Pro 
325 330 335 

Pro Lys His Arg Lys Leu Phe Leu Phe Pro He Glu Arg Ala Gin Arg 
340 345 350 

Glu Thr Val Leu Gly Ala Tyr Phe Phe Glu Leu Gly Arg Pro Leu Leu 
355 360 365 



Val Trp He Phe Arg Thr Ala Gly Ser Phe Thr Thr Leu Met Asn Al; 
370 375 380 

Leu Tyr Ala Lys Tyr Glu Thr His 
385 390 



<210> 91 
<211> 1359 
<212> DNA 

<213> Drosophila melanogaster 

<220> 
<221> CDS 
<222> (1) . . (1359) 
<223> DORLU 25.1 

<400> 91 

atg aag age aca ttc aag gaa gaa agg att aag gac gac tec aag cgt 4 8 
Met Lys Ser Thr Phe Lys Glu Glu Arg lie Lys Asp Asp Ser Lys Arg 
15 10 15 

cgc gac ctg ttt gta ttc gtg agg caa acc atg tgt ata gcg gec atg 96 
Arg Asp Leu Phe Val Phe Val Arg Gin Thr Met Cys He Ala Ala Met 
20 25 30 

tat ccc ttc ggt tac tac gtg aat gga tct gga gtc ctg gec gtt ctg 144 
Tyr Pro Phe Gly Tyr Tyr Val Asn Gly Ser Gly Val Leu Ala Val Leu 
35 40 45 

gtg cga ttc tgt gac ttg acc tac gag etc ttt aac tac ttc gtt teg 192 
Val Arg Phe Cys Asp Leu Thr Tyr Glu Leu Phe Asn Tyr Phe Val Ser 
50 55 60 

gta cac ata get ggc ctg tac ate tgc acc ate tac ate aac tat ggg 240 
Val His He Ala Gly Leu Tyr He Cys Thr He Tyr He Asn Tyr Gly 
65 70 75 80 

caa ggc gat ttg gac ttc ttc gtg aac tgt ttg ata caa acc att att 288 
Gin Gly Asp Leu Asp Phe Phe Val Asn Cys Leu He Gin Thr He He 
85 90 95 

tat ctg tgg aca ata gcg atg aaa etc tac ttt egg agg ttc aga cct 336 
Tyr Leu Trp Thr He Ala Met Lys Leu Tyr Phe Arg Arg Phe Arg Pro 
100 105 110 
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ggt ttg ttg aat acc att ctg tec aac ate aat gat gag tac gag aca 
Gly Leu Leu Asn Thr lie Leu Ser Asn lie Asn Asp Glu Tyr Glu Thr 
115 120 125 

cgt teg get gtg gga ttc agt ttc gtc aca atg gcg gga tec tat egg 
Arg Ser Ala Val Gly Phe Ser Phe Val Thr Met Ala Gly Ser Tyr Arg 
130 135 140 

atg tec aag eta tgg ate aaa acc tat gtg tat tgc tgc tac ata ggc 
Met Ser Lys Leu Trp lie Lys Thr Tyr Val Tyr Cys Cys Tyr lie Gly 
145 150 155 160 

acc att ttc tgg ctg get ctt ccc att gec tac egg gat agg agt ctt 
Thr lie Phe Trp Leu Ala Leu Pro lie Ala Tyr Arg Asp Arg Ser Leu 
165 170 175 

cct ctt gec tgc tgg tat ccc ttt gac tat aca caa ccc ggt gtc tat 
Pro Leu Ala Cys Trp Tyr Pro Phe Asp Tyr Thr Gin Pro Gly Val Tyr 
180 185 190 

gag gta gtg ttc ctt etc cag gcg atg gga cag ate caa gtg gec gca 
Glu Val Val Phe Leu Leu Gin Ala Met Gly Gin lie Gin Val Ala Ala 
195 200 205 

tec ttt gee tec tec agt ggc ctg cat atg gtg ctt tgt gtg ctg ata 
Ser Phe Ala Ser Ser Ser Gly Leu His Met Val Leu Cys Val Leu lie 
210 215 220 

tea ggg cag tac gat gtc etc ttt tgc agt etc aag aat gta tta gec 
Ser Gly Gin Tyr Asp Val Leu Phe Cys Ser Leu Lys Asn Val Leu Ala 
225 230 235 240 

age age tat gtc ctt atg gga gee aat atg acg gaa ctg aat caa ttg 
Ser Ser Tyr Val Leu Met Gly Ala Asn Met Thr Glu Leu Asn Gin Leu 
245 250 255 

cag get gag caa tct gcg gec gat gtc gag cca ggt cag tat get tac 
Gin Ala Glu Gin Ser Ala Ala Asp Val Glu Pro Gly Gin Tyr Ala Tyr 
260 265 270 

tec gtg gag gag gag aca cct ttg caa gaa ctt eta aaa gtt ggg age 
Ser Val Glu Glu Glu Thr Pro Leu Gin Glu Leu Leu Lys Val Gly Ser 
275 280 285 

tea atg gac ttc tec tec gca ttc agg ctg tct ttt gtg egg tgc att 
Ser Met Asp Phe Ser Ser Ala Phe Arg Leu Ser Phe Val Arg Cys He 
290 295 300 



184 



HflUIJUi 



cag cac cat cga tac ata gtg gcg gca ctg aag aaa att gag agt ttc 960 
Gin His His Arg Tyr lie Val Ala Ala Leu Lys Lys lie Glu Ser Phe 
305 310 315 320 

tac agt ccc ata tgg ttc gtg aag att ggc gaa gtc acc ttt ctt atg 1008 
Tyr Ser Pro lie Trp Phe Val Lys He Gly Glu Val Thr Phe Leu Met 
325 330 335 

tgc ctg gta gcc ttc gtc tec acg aag age acc gcg gec aac tea ttc 1056 
Cys Leu Val Ala Phe Val Ser Thr Lys Ser Thr Ala Ala Asn Ser Phe 
340 345 350 

atg cga atg gtc tec ttg ggc cag tac ctg etc tta gtt etc tac gag 1104 
Met Arg Met Val Ser Leu Gly Gin Tyr Leu Leu Leu Val Leu Tyr Glu 
355 360 365 

ctg ttc ate ate tgc tac ttc gcg gac ate gtt ttt cag aac age cag 1152 
Leu Phe He lie Cys Tyr Phe Ala Asp He Val Phe Gin Asn Ser Gin 
370 375 380 

egg tgc ggt gaa gcc etc tgg cga agt cct tgg cag cga cat ttg aag 1200 
Arg Cys Gly Glu Ala Leu Trp Arg Ser Pro Trp Gin Arg His Leu Lys 
385 390 395 400 

gat gtt cgc agt gat tac atg ttc ttt atg ctg aat tec cgc agg cag 1248 
Asp Val Arg Ser Asp Tyr Met Phe Phe Met Leu Asn Ser Arg Arg Gin 
405 410 415 

ttc caa ctt acg gcc gga aaa ata age aat eta aac gtg gat cgt ttc 1296 
Phe Gin Leu Thr Ala Gly Lys He Ser Asn Leu Asn Val Asp Arg Phe 
420 425 430 

aga ggg act att act act gcc ttc teg ttt etc acc ttg ctg caa aag 1344 
Arg Gly Thr He Thr Thr Ala Phe Ser Phe Leu Thr Leu Leu Gin Lys 
435 440 445 

atg gat gca cga gaa 1359 
Met Asp Ala Arg Glu 
450 



<210> 92 
<211> 453 
<212> PRT 

<213> Drosophila melanogaster 



<400> 92 

Met Lys Ser Thr Phe Lys Glu Glu Arg He Lys Asp Asp Ser Lys Arg 



185 



Arg Asp Leu Phe Val Phe Val Arg Gin Thr Met Cys lie Ala Ala Met 

20 25 30 

Tyr Pro Phe Gly Tyr Tyr Val Asn Gly Ser Gly Val Leu Ala Val Leu 

35 40 45 

Val Arg Phe Cys Asp Leu Thr Tyr Glu Leu Phe Asn Tyr Phe Val Ser 



Val His He Ala Gly Leu Tyr He Cys Thr He Tyr He Asn Tyr Gly 
65 70 75 80 

Gin Gly Asp Leu Asp Phe Phe Val Asn Cys Leu He Gin Thr He He 
85 90 95 

Tyr Leu Trp Thr He Ala Met Lys Leu Tyr Phe Arg Arg Phe Arg Pro 
100 105 110 

Gly Leu Leu Asn Thr He Leu Ser Asn He Asn Asp Glu Tyr Glu Thr 
115 120 125 

Arg Ser Ala Val Gly Phe Ser Phe Val Thr Met Ala Gly Ser Tyr Arg 
130 135 140 

Met Ser Lys Leu Trp He Lys Thr Tyr Val Tyr Cys Cys Tyr He Gly 
145 150 155 160 

Thr He Phe Trp Leu Ala Leu Pro He Ala Tyr Arg Asp Arg Ser Leu 
165 170 175 

Pro Leu Ala Cys Trp Tyr Pro Phe Asp Tyr Thr Gin Pro Gly Val Tyr 
180 185 190 

Glu Val Val Phe Leu Leu Gin Ala Met Gly Gin He Gin Val Ala Ala 
195 200 205 

Ser Phe Ala Ser Ser Ser Gly Leu His Met Val Leu Cys Val Leu He 
210 215 220 

Ser Gly Gin Tyr Asp Val Leu Phe Cys Ser Leu Lys Asn Val Leu Ala 
225 230 235 240 

Ser Ser Tyr Val Leu Met Gly Ala Asn Met Thr Glu Leu Asn Gin Leu 
245 250 255 

Gin Ala Glu Gin Ser Ala Ala Asp Val Glu Pro Gly Gin Tyr Ala Tyr 



186 



260 



265 



270 



Ser Val Glu Glu Glu Thr Pro Leu Gin Glu Leu Leu Lys Val Gly Ser 
275 280 285 

Ser Met Asp Phe Ser Ser Ala Phe Arg Leu Ser Phe Val Arg Cys lie 
290 295 300 

Gin His His Arg Tyr lie Val Ala Ala Leu Lys Lys lie Glu Ser Phe 
305 310 315 320 

Tyr Ser Pro lie Trp Phe Val Lys lie Gly Glu Val Thr Phe Leu Met 
325 330 335 

Cys Leu Val Ala Phe Val Ser Thr Lys Ser Thr Ala Ala Asn Ser Phe 
340 345 350 

Met Arg Met Val Ser Leu Gly Gin Tyr Leu Leu Leu Val Leu Tyr Glu 
355 360 365 

Leu Phe lie lie Cys Tyr Phe Ala Asp He Val Phe Gin Asn Ser Gin 
370 375 380 

Arg Cys Gly Glu Ala Leu Trp Arg Ser Pro Trp Gin Arg His Leu Lys 
385 390 395 400 

Asp Val Arg Ser Asp Tyr Met Phe Phe Met Leu Asn Ser Arg Arg Gin 
405 410 415 

Phe Gin Leu Thr Ala Gly Lys He Ser Asn Leu Asn Val Asp Arg Phe 
420 425 430 

Arg Gly Thr He Thr Thr Ala Phe Ser Phe Leu Thr Leu Leu Gin Lys 
435 440 445 

Met Asp Ala Arg Glu 
450 



<210> 93 
<211> 1296 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1296) 



187 



<223> DORLU 26.1 



<400> 93 

atg aaa gtg ggt ttt gca act att ggc tat ata aag agt att cct tgc 48 
Met Lys Val Gly Phe Ala Thr He Gly Tyr lie Lys Ser lie Pro Cys 
15 10 15 



cag gat gtc gtt cac ata gtt ata tec ate atg tec gag tgg tta cgc 
Gin Asp Val Val His lie Val He Ser He Met Ser Glu Trp Leu Arg 



ttt ctg aaa cgc gat caa cag ctg gat gtg tac ttt ttt gca gtg ccc 144 
Phe Leu Lys Arg Asp Gin Gin Leu Asp Val Tyr Phe Phe Ala Val Pro 
35 40 45 



cgc ttg agt tta gac ata atg ggc tat tgg ccg ggc aaa act ggt gat 192 
Arg Leu Ser Leu Asp He Met Gly Tyr Trp Pro Gly Lys Thr Gly Asp 
50 55 60 



aca tgg ccc tgg aga tec ctg att cac ttc gca ate ctg gec att ggc 240 
Thr Trp Pro Trp Arg Ser Leu He His Phe Ala lie Leu Ala He Gly 
65 70 75 80 



gtg gec acc gaa ctg cat get ggc atg tgt ttt eta gac cga cag cag 288 
Val Ala Thr Glu Leu His Ala Gly Met Cys Phe Leu Asp Arg Gin Gin 
85 90 95 



att acc ttg gca ctg gag acc etc tgt cca get ggc aca teg gcg gtc 336 
He Thr Leu Ala Leu Glu Thr Leu Cys Pro Ala Gly Thr Ser Ala Val 
100 105 110 



acg ctg etc aag atg ttc eta atg ctg cgc ttt cgt cag gat etc tec 384 
Thr Leu Leu Lys Met Phe Leu Met Leu Arg Phe Arg Gin Asp Leu Ser 
115 120 125 



att atg tgg aac cgc ctg agg ggc ctg etc ttc gat ccc aac tgg gag 432 
He Met Trp Asn Arg Leu Arg Gly Leu Leu Phe Asp Pro Asn Trp Glu 
130 135 140 



cga ccc gag cag egg gac ate egg eta aag cac teg gec atg gcg get 4 80 
Arg Pro Glu Gin Arg Asp He Arg Leu Lys His Ser Ala Met Ala Ala 
145 150 155 160 



cgc ate aat ttc tgg ccc ctg tea gee gga ttc ttc aca tgc acc acc 
Arg He Asn Phe Trp Pro Leu Ser Ala Gly Phe Phe Thr Cys Thr Thr 
165 170 175 



tac aac eta aag ccg ata ctg ate gca atg ata ttg tat etc cag aat 576 
188 



Tyr Asn Leu 



cgt tac gag 
Arg Tyr Glu 
195 

aaa gtt ctg 
Lys Val Leu 
210 

gcc tat acg 
Ala Tyr Thr 
225 

ttt tat ttc 
Phe Tyr Phe 



cag gcg gag 
Gin Ala Glu 



ctg teg cca 
Leu Ser Pro 
275 

ate att agg 
lie lie Arg 
290 

cgc tat acc 
Arg Tyr Thr 
305 

att gga ttc 
He Gly Phe 



ggc gca atg 
Gly Ala Met 



ctg gtt tat 
Leu Val Tyr 
355 

ctg tgc cga 



Lys Pro He 
180 

gac ttc gtt 
Asp Phe Val 



eta aac tat 
Leu Asn Tyr 



ggc tat gtg 
Gly Tyr Val 
230 

gag ttc tgt 
Glu Phe Cys 
245 

ata gaa tea 
He Glu Ser 
260 

gtg cag ctt 
Val Gin Leu 



cac aat gcc 
His Asn Ala 



att att acc 
He He Thr 
310 

age atg gtt 
Ser Met Val 
325 

etc tat gtg 

Leu Tyr Val 
340 

tgc tat ggc 
Cys Tyr Gly 



gcc atg ttc 



Leu He Ala 

185 

tgg ttt aca 
Trp Phe Thr 
200 

cca ttt ttt 
Pro Phe Phe 
215 

acc ate ttt 
Thr He Phe 



gcc cac eta 
Ala His Leu 



atg ttt aga 
Met Phe Arg 
265 

tac att tta 
Tyr He Leu 
280 

ate ate gat 
He He Asp 
295 

ctg gcc cat 
Leu Ala His 



aat etc ctg 
Asn Leu Leu 



gcc tac acg 
Ala Tyr Thr 
345 

gga act ctg 
Gly Thr Leu 
360 

tec tgt ccg 



Met He Leu 



ccc ttc aat 
Pro Phe Asn 



ccc ctg acc 
Pro Leu Thr 
220 

atg ttc ggc 
Met Phe Gly 
235 

tea get ctt 
Ser Ala Leu 
250 

ccc tac act 
Pro Tyr Thr 



gag caa aag 
Glu Gin Lys 



ttg acc aga 
Leu Thr Arg 
300 

ttt gtg tec 
Phe Val Ser 
315 

aca ttg ggc 
Thr Leu Gly 
330 

gtt gcc get 

Val Ala Ala 



gtg gcc gaa 
Val Ala Glu 



tgg cag ctt 



Tyr Leu Gin 
190 

atg act atg 
Met Thr Met 
205 

tac ata ttt 
Tyr He Phe 



ggc tgt gat 
Gly Cys Asp 



ttc gaa gtg 
Phe Glu Val 
255 

gat cac ttg 
Asp His Leu 
270 

atg cga tea 
Met Arg Ser 
285 

ttt ttt cgt 
Phe Phe Arg 



gcc gcc atg 
Ala Ala Met 



aat aat ggt 
Asn Asn Gly 
335 

ttg age caa 
Leu Ser Gin 
350 

agt age act 
Ser Ser Thr 
365 

ttt aag cct 



Asn 



ccc 624 
Pro 



att 672 
He 



ggt 720 

Gly 

240 

etc 768 
Leu 



gaa 816 
Glu 



gta 864 
Val 



gat 912 
Asp 



gtg 960 

Val 

320 

ctg 1008 
Leu 



ctg 1056 
Leu 



ggt 1104 
Gly 



aaa 1152 



189 



Leu Cys Arg Ala Met Phe Ser Cys Pro Trp Gin Leu Phe Lys Pro Lys 
370 375 380 



caa cgt cga etc gtt cag ctt ttg att etc aga teg cag cgt cct gtt 1200 
Gin Arg Arg Leu Val Gin Leu Leu lie Leu Arg Ser Gin Arg Pro Val 
385 390 395 400 

tec atg gca gtg cca ttc ttt teg cca teg ttg get acc ttt get gcg 1248 
Ser Met Ala Val Pro Phe Phe Ser Pro Ser Leu Ala Thr Phe Ala Ala 
405 410 415 

att ctt caa act teg ggt tec ata att gcg ctg gtt aag tec ttt cag 1296 
lie Leu Gin Thr Ser Gly Ser lie lie Ala Leu Val Lys Ser Phe Gin 
420 425 430 



<210> 94 
<211> 432 
<212> PRT 

<213> Drosophila melanogaster 
<400> 94 

Met Lys Val Gly Phe Ala Thr lie Gly Tyr lie Lys Ser He Pro Cys 



Gin Asp Val Val His He Val He Ser He Met Ser Glu Trp Leu Arg 



Phe Leu Lys Arg Asp Gin Gin Leu Asp Val Tyr Phe Phe Ala Val Pro 
35 40 45 

Arg Leu Ser Leu Asp He Met Gly Tyr Trp Pro Gly Lys Thr Gly Asp 



Thr Trp Pro Trp Arg Ser Leu He His Phe Ala He Leu Ala He Gly 



Val Ala Thr Glu Leu His Ala Gly Met Cys Phe Leu Asp Arg Gin Gin 
85 90 95 

lie Thr Leu Ala Leu Glu Thr Leu Cys Pro Ala Gly Thr Ser Ala Val 
100 105 110 

Thr Leu Leu Lys Met Phe Leu Met Leu Arg Phe Arg Gin Asp Leu Ser 
115 120 125 

He Met Trp Asn Arg Leu Arg Gly Leu Leu Phe Asp Pro Asn Trp Glu 
130 135 140 



m 



Arg Pro Glu Gin Arg Asp lie Arg Leu Lys His Ser Ala Met Ala Ala 
145 150 155 160 



Arg lie Asn Phe Trp 
165 

Tyr Asn Leu Lys Pro 

180 

Arg Tyr Glu Asp Phe 
195 

Lys Val Leu Leu Asn 
210 

Ala Tyr Thr Gly Tyr 
225 

Phe Tyr Phe Glu Phe 
245 

Gin Ala Glu He Glu 
260 

Leu Ser Pro Val Gin 
275 

lie He Arg His Asn 
290 

Arg Tyr Thr He He 

305 

He Gly Phe Ser Met 
325 

Gly Ala Met Leu Tyr 
340 

Leu Val Tyr Cys Tyr 
355 

Leu Cys Arg Ala Met 
370 

Gin Arg Arg Leu Val 
385 



Pro Leu Ser Ala Gly Phe 
170 

He Leu He Ala Met He 
185 

Val Trp Phe Thr Pro Phe 
200 

Tyr Pro Phe Phe Pro Leu 
215 

Val Thr He Phe Met Phe 
230 235 

Cys Ala His Leu Ser Ala 
250 

Ser Met Phe Arg Pro Tyr 
265 

Leu Tyr He Leu Glu Gin 
280 

Ala He He Asp Leu Thr 
295 

Thr Leu Ala His Phe Val 
310 315 

Val Asn Leu Leu Thr Leu 

330 

Val Ala Tyr Thr Val Ala 
345 

Gly Gly Thr Leu Val Ala 
360 

Phe Ser Cys Pro Trp Gin 
375 

Gin Leu Leu He Leu Arg 
390 395 



Phe Thr Cys Thr Thr 
175 

Leu Tyr Leu Gin Asn 
190 

Asn Met Thr Met Pro 
205 

Thr Tyr He Phe He 
220 

Gly Gly Cys Asp Gly 
240 

Leu Phe Glu Val Leu 
255 

Thr Asp His Leu Glu 
270 

Lys Met Arg Ser Val 
285 

Arg Phe Phe Arg Asp 
300 

Ser Ala Ala Met Val 
320 

Gly Asn Asn Gly Leu 
335 

Ala Leu Ser Gin Leu 
350 

Glu Ser Ser Thr Gly 
365 

Leu Phe Lys Pro Lys 
380 

Ser Gin Arg Pro Val 
400 



191 



Ser Met Ala Val Pro Phe Phe Ser Pro Ser Leu Ala Thr Phe Ala Ala 
405 410 415 

lie Leu Gin Thr Ser Gly Ser lie lie Ala Leu Val Lys Ser Phe Gin 
420 425 430 



<210> 95 
<211> 1047 
<212> DNA 

<213> Drosophila melanogaster 



<220> 

<221> CDS 

<222> (1) . . (1047) 

<223> DORLU 27.1 



<400> 95 

atg tct ggt tgc aga gca atg gcc tta 
Met Ser Gly Cys Arg Ala Met Ala Leu 
1 5 



ttt aca acc aca gaa gaa cgt 48 
Phe Thr Thr Thr Glu Glu Arg 
10 15 



ctg ctg ccc tac cga tct aaa tgg cac acc ttg gta tat att caa atg 
Leu Leu Pro Tyr Arg Ser Lys Trp His Thr Leu Val Tyr lie Gin Met 



gtt ata ttt ttt get tea atg age ttt ggc tta acg gaa teg atg gga 144 
Val lie Phe Phe Ala Ser Met Ser Phe Gly Leu Thr Glu Ser Met Gly 
35 40 45 



gac cat gtt caa atg gga egg gac tta gcc ttc ate ctt ggg aca tat 192 
Asp His Val Gin Met Gly Arg Asp Leu Ala Phe lie Leu Gly Thr Tyr 
50 55 60 



tat ttc tgc tgg tat ggc gat gaa ctt gac caa gtg ate age gat ctg 240 
Tyr Phe Cys Trp Tyr Gly Asp Glu Leu Asp Gin Val lie Ser Asp Leu 
65 70 75 80 



gac get eta cat cct tgg gca cag aaa ggt cct aat cca gtt gaa tat 288 
Asp Ala Leu His Pro Trp Ala Gin Lys Gly Pro Asn Pro Val Glu Tyr 
85 90 95 



cag act ggt aaa cgt tgg tac ttc gta atg get ttt ttc ttg gca acg 336 
Gin Thr Gly Lys Arg Trp Tyr Phe Val Met Ala Phe Phe Leu Ala Thr 
100 105 110 



192 



tea tgg teg ttc ttc ttg tgc att ttg eta ttg tta ctt ata acc tea 
Ser Trp Ser Phe Phe Leu Cys lie Leu Leu Leu Leu Leu lie Thr Ser 
115 120 125 

ccc atg tgg gtc cat cag caa aac ctt ccc ttt cat gcg gcg ttt cct 
Pro Met Trp Val His Gin Gin Asn Leu Pro Phe His Ala Ala Phe Pro 
130 135 140 

ttt caa tgg cac gaa aaa teg ctt cat ccc ate age cac get ata ate 
Phe Gin Trp His Glu Lys Ser Leu His Pro lie Ser His Ala lie lie 
145 150 155 160 

tat ctg ttt cag age tat ttt gca gtg tat tgt ctg act tgg ctt ttg 
Tyr Leu Phe Gin Ser Tyr Phe Ala Val Tyr Cys Leu Thr Trp Leu Leu 
165 170 175 

tgc ata gag gga eta tea att tgt att tat gcg gaa att act ttc ggc 
Cys He Glu Gly Leu Ser He Cys He Tyr Ala Glu lie Thr Phe Gly 
180 185 190 

att gaa gtt tta tgc eta gaa eta cgc caa att cac cga cac aat tat 
He Glu Val Leu Cys Leu Glu Leu Arg Gin He His Arg His Asn Tyr 
195 200 205 

ggc ctt caa gaa ctg aga atg gag acg aac cgc ttg gtc aag eta cat 
Gly Leu Gin Glu Leu Arg Met Glu Thr Asn Arg Leu Val Lys Leu His 
210 215 220 

cag aag att atg ggt gtt aac ttt tec ttg gtg tec ttg teg gtt ttg 
Gin Lys He Met Gly Val Asn Phe Ser Leu Val Ser Leu Ser Val Leu 
225 230 235 240 

gag gec gtg gag get egg aag gac ccc aaa gtt gtg gee cag ttt gca 
Glu Ala Val Glu Ala Arg Lys Asp Pro Lys Val Val Ala Gin Phe Ala 
245 250 255 

gtc ctt atg ttg etc gec tta gga cat eta tct atg tgg teg tat tgt 
Val Leu Met Leu Leu Ala Leu Gly His Leu Ser Met Trp Ser Tyr Cys 
260 265 270 

gga gac cag tta tec cag aag tea ttg caa att teg gag get gec tat 
Gly Asp Gin Leu Ser Gin Lys Ser Leu Gin He Ser Glu Ala Ala Tyr 
275 280 285 

gag get tac gac cca acc aaa gga tea aag gat gtg tat aga gac etc 
Glu Ala Tyr Asp Pro Thr Lys Gly Ser Lys Asp Val Tyr Arg Asp Leu 
290 295 300 
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tgc gta ata ate agg cgt ggc cag gac cct ttg ate atg aga gec age 960 
Cys Val lie lie Arg Arg Gly Gin Asp Pro Leu lie Met Arg Ala Ser 
305 310 315 320 

cca ttt ccg tec ttt aat tta ata aac tac age get ata ctt aac caa 1008 
Pro Phe Pro Ser Phe Asn Leu lie Asn Tyr Ser Ala lie Leu Asn Gin 
325 330 335 

tgt tat gga ate ctg aca ttt ttg eta aag aca tta gac 1047 
Cys Tyr Gly lie Leu Thr Phe Leu Leu Lys Thr Leu Asp 
340 345 



<210> 96 
<211> 349 
<212> PRT 

<213> Drosophila melanogaster 
<400> 96 

Met Ser Gly Cys Arg Ala Met Ala Leu Phe Thr Thr Thr Glu Glu Arg 



Leu Leu Pro Tyr Arg Ser Lys Trp His Thr Leu Val Tyr lie Gin Met 



Val lie Phe Phe Ala Ser Met Ser Phe Gly Leu Thr Glu Ser Met Gly 
35 40 45 

Asp His Val Gin Met Gly Arg Asp Leu Ala Phe lie Leu Gly Thr Tyr 



Tyr Phe Cys Trp Tyr Gly Asp Glu Leu Asp Gin Val lie Ser Asp Leu 

65 70 75 80 

Asp Ala Leu His Pro Trp Ala Gin Lys Gly Pro Asn Pro Val Glu Tyr 



Gin Thr Gly Lys Arg Trp Tyr Phe Val Met Ala Phe Phe Leu Ala Thr 
100 105 110 

Ser Trp Ser Phe Phe Leu Cys lie Leu Leu Leu Leu Leu lie Thr Ser 
115 120 125 

Pro Met Trp Val His Gin Gin Asn Leu Pro Phe His Ala Ala Phe Pro 
130 135 140 

Phe Gin Trp His Glu Lys Ser Leu His Pro lie Ser His Ala lie lie 
145 150 155 160 



m 



Tyr Leu Phe Gin Ser Tyr Phe Ala Val Tyr Cys Leu Thr Trp Leu Leu 
165 170 175 



Cys lie Glu Gly Leu Ser He Cys He Tyr Ala Glu He Thr Phe Gly 
180 185 190 

lie Glu Val Leu Cys Leu Glu Leu Arg Gin He His Arg His Asn Tyr 
195 200 205 

Gly Leu Gin Glu Leu Arg Met Glu Thr Asn Arg Leu Val Lys Leu His 
210 215 220 

Gin Lys He Met Gly Val Asn Phe Ser Leu Val Ser Leu Ser Val Leu 
225 230 235 240 

Glu Ala Val Glu Ala Arg Lys Asp Pro Lys Val Val Ala Gin Phe Ala 
245 250 255 

Val Leu Met Leu Leu Ala Leu Gly His Leu Ser Met Trp Ser Tyr Cys 
260 265 270 

Gly Asp Gin Leu Ser Gin Lys Ser Leu Gin He Ser Glu Ala Ala Tyr 
275 280 285 

Glu Ala Tyr Asp Pro Thr Lys Gly Ser Lys Asp Val Tyr Arg Asp Leu 
290 295 300 

Cys Val lie lie Arg Arg Gly Gin Asp Pro Leu lie Met Arg Ala Ser 
305 310 315 320 

Pro Phe Pro Ser Phe Asn Leu He Asn Tyr Ser Ala lie Leu Asn Gin 
325 330 335 

Cys Tyr Gly lie Leu Thr Phe Leu Leu Lys Thr Leu Asp 
340 345 



<210> 97 
<211> 1185 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (1185) 

<223> DORLU 28.1 

195 



k. U i m \ ULiii juui i mi 1 1 1 ii mil .mm 



<400> 97 

atg gaa ttg aaa teg atg gat ccg gtg gag atg ccc att ttt ggt age 

Met Glu Leu Lys Ser Met Asp Pro Val Glu Met Pro lie Phe Gly Ser 
15 10 15 

act ctg aag eta atg aag ttc tgg tea tat ctg ttt gtt cac aac tgg 
Thr Leu Lys Leu Met Lys Phe Trp Ser Tyr Leu Phe Val His Asn Trp 



cgc cgc tat gtc gca atg act ccg tac ate att ate aac tgt act cag 
Arg Arg Tyr Val Ala Met Thr Pro Tyr lie lie lie Asn Cys Thr Gin 



tat gtg gat ata tat ctg age ace gaa tec ttg gac ttt ate ate aga 
Tyr Val Asp lie Tyr Leu Ser Thr Glu Ser Leu Asp Phe lie lie Arg 



aat gta tac ctg get gta ttg ttt ace aac acg gtg gtc aga ggt gta 
Asn Val Tyr Leu Ala Val Leu Phe Thr Asn Thr Val Val Arg Gly Val 



ttg tta tgc gta cag egg ttt age tac gag cgt ttc att aat att ttg 
Leu Leu Cys Val Gin Arg Phe Ser Tyr Glu Arg Phe lie Asn lie Leu 
85 90 95 

aaa age ttt tac att gag ttg ttg caa tea gat gac ccc ate ata aac 
Lys Ser Phe Tyr lie Glu Leu Leu Gin Ser Asp Asp Pro lie lie Asn 
100 105 110 

att ttg gtc aag gaa acc aca cgc eta tea gtt tta att agt agg att 
lie Leu Val Lys Glu Thr Thr Arg Leu Ser Val Leu lie Ser Arg lie 
115 120 125 

aat tta tta atg ggc tgc tgc act tgc att ggc ttt gtt aca tat ccc 
Asn Leu Leu Met Gly Cys Cys Thr Cys lie Gly Phe Val Thr Tyr Pro 
130 135 140 

att ttt ggt teg gaa aga gtt ctg cca tat ggc atg tat ttg ccc act 
lie Phe Gly Ser Glu Arg Val Leu Pro Tyr Gly Met Tyr Leu Pro Thr 
145 150 155 160 

att gat gaa tac aaa tac gca tea cct tac tac gag att ttc ttt gtg 
lie Asp Glu Tyr Lys Tyr Ala Ser Pro Tyr Tyr Glu lie Phe Phe Val 
165 170 175 

att caa gec att atg get cca atg ggg tgt tgc atg tac ata cca tac 
He Gin Ala He Met Ala Pro Met Gly Cys Cys Met Tyr He Pro Tyr 



196 



180 



185 



190 



aca aac atg gta gtg aca ttt acc ctt ttc gcc att etc atg tgt cga 624 
Thr Asn Met Val Val Thr Phe Thr Leu Phe Ala lie Leu Met Cys Arg 
195 200 205 

gtg ttg caa cat aag ttg aga age eta gaa aag ctg aaa aat gaa caa 672 
Val Leu Gin His Lys Leu Arg Ser Leu Glu Lys Leu Lys Asn Glu Gin 
210 215 220 

gta cgt ggt gaa ate ata tgg tgc ata aaa tat caa tta aaa tta tea 720 
Val Arg Gly Glu He He Trp Cys He Lys Tyr Gin Leu Lys Leu Ser 
225 230 235 240 

gga ttt gtt gat tea atg aat gcc ttg aac acc cat ctt cat ttg gtg 768 
Gly Phe Val Asp Ser Met Asn Ala Leu Asn Thr His Leu His Leu Val 
245 250 255 

gag ttc ctt tgc ttt ggt gcc atg eta tgt gtt ctt ctt ttc tec tta 816 
Glu Phe Leu Cys Phe Gly Ala Met Leu Cys Val Leu Leu Phe Ser Leu 
260 265 270 

ata att get caa aca att get cag acc gtc ata gtc ate gca tac atg 864 
He He Ala Gin Thr He Ala Gin Thr Val He Val He Ala Tyr Met 
275 280 285 

gta atg ata ttt gcc aac agt gta gtc ctt tac tac gtg gcc aat gag 912 
Val Met He Phe Ala Asn Ser Val Val Leu Tyr Tyr Val Ala Asn Glu 
290 295 300 

eta tac ttt caa gta aga gtt gtc caa ttt tct ttt aaa ttt ttg tat 960 
Leu Tyr Phe Gin Val Arg Val Val Gin Phe Ser Phe Lys Phe Leu Tyr 
305 310 315 320 

aag tat ggg att ttg cag age ttt gat att gcc att get gcc tat gag 1008 
Lys Tyr Gly He Leu Gin Ser Phe Asp He Ala He Ala Ala Tyr Glu 
325 330 335 

age aat tgg atg gac ttt gat gtg gac aca caa aag act ttg aag ttc 1056 
Ser Asn Trp Met Asp Phe Asp Val Asp Thr Gin Lys Thr Leu Lys Phe 
340 345 350 

etc ate atg cgc teg caa aag ccc ttg gcg act ctg gtg ggt ggc aca 1104 
Leu He Met Arg Ser Gin Lys Pro Leu Ala Thr Leu Val Gly Gly Thr 
355 360 365 

tat ccc atg aac ttg aaa atg ctt cag tea eta eta aat gcc att tac 1152 
Tyr Pro Met Asn Leu Lys Met Leu Gin Ser Leu Leu Asn Ala He Tyr 
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370 375 : 

tec ttc ttc acc ctt ctg cgt cgc gtt tac ggc 

Ser Phe Phe Thr Leu Leu Arg Arg Val Tyr Gly 
385 390 395 



<210> 98 
<211> 395 
<212> PRT 

<213> Drosophila melanogaster 



<400> 98 

Met Glu Leu Lys Ser Met Asp Pro Val Glu Met Pro lie Phe Gly Ser 



Thr Leu Lys Leu Met Lys Phe Trp Ser Tyr Leu Phe Val His Asn Trp 

20 25 30 

Arg Arg Tyr Val Ala Met Thr Pro Tyr lie lie lie Asn Cys Thr Gin 

35 40 45 

Tyr Val Asp lie Tyr Leu Ser Thr Glu Ser Leu Asp Phe lie lie Arg 



Asn Val Tyr Leu Ala Val Leu Phe Thr Asn Thr Val Val Arg Gly Val 

65 70 75 80 

Leu Leu Cys Val Gin Arg Phe Ser Tyr Glu Arg Phe lie Asn lie Leu 
85 90 95 

Lys Ser Phe Tyr lie Glu Leu Leu Gin Ser Asp Asp Pro lie lie Asn 
100 105 110 

He Leu Val Lys Glu Thr Thr Arg Leu Ser Val Leu He Ser Arg He 
115 120 125 

Asn Leu Leu Met Gly Cys Cys Thr Cys He Gly Phe Val Thr Tyr Pro 
130 135 140 

He Phe Gly Ser Glu Arg Val Leu Pro Tyr Gly Met Tyr Leu Pro Thr 
145 150 155 160 

He Asp Glu Tyr Lys Tyr Ala Ser Pro Tyr Tyr Glu He Phe Phe Val 
165 170 175 

He Gin Ala He Met Ala Pro Met Gly Cys Cys Met Tyr He Pro Tyr 
180 185 190 
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Thr Asn Met Val Val Thr Phe Thr Leu Phe Ala lie Leu Met Cys Arg 
195 200 205 



Val Leu Gin His Lys 
210 

Val Arg Gly Glu lie 
225 

Gly Phe Val Asp Ser 
245 

Glu Phe Leu Cys Phe 
260 

He He Ala Gin Thr 
275 

Val Met He Phe Ala 
290 

Leu Tyr Phe Gin Val 
305 

Lys Tyr Gly He Leu 
325 

Ser Asn Trp Met Asp 
340 

Leu He Met Arg Ser 
355 

Tyr Pro Met Asn Leu 
370 

Ser Phe Phe Thr Leu 
385 



Leu Arg Ser Leu Glu Lys 
215 

lie Trp Cys He Lys Tyr 
230 235 

Met Asn Ala Leu Asn Thr 
250 

Gly Ala Met Leu Cys Val 
265 

He Ala Gin Thr Val He 
280 

Asn Ser Val Val Leu Tyr 
295 

Arg Val Val Gin Phe Ser 
310 315 

Gin Ser Phe Asp He Ala 
330 

Phe Asp Val Asp Thr Gin 
345 

Gin Lys Pro Leu Ala Thr 
360 

Lys Met Leu Gin Ser Leu 
375 

Leu Arg Arg Val Tyr Gly 
390 395 



Leu Lys Asn Glu Gin 
220 

Gin Leu Lys Leu Ser 
240 

His Leu His Leu Val 
255 

Leu Leu Phe Ser Leu 
270 

Val He Ala Tyr Met 
285 

Tyr Val Ala Asn Glu 
300 

Phe Lys Phe Leu Tyr 
320 

He Ala Ala Tyr Glu 
335 

Lys Thr Leu Lys Phe 
350 

Leu Val Gly Gly Thr 
365 

Leu Asn Ala He Tyr 
380 



<210> 99 
<211> 1579 
<212> DNA 

<213> Drosophila melanogaster 
<220> 
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<223> DOR 22A.2, NCBI Accession No. AF127924 



<220> 

<221> introh 
<222> (1) . . (122) 

<220> 

<221> intron 
<222> (251) . . (310) 

<220> 

<221> intron 

<222> (950) . . (1579) 



<400> 99 

cgaaacaaag cttaagttct tgaaatcgcc aagtcatgtg agaccatttg caacagtgag 60 
gcggcctata aaaagcactg ttttccagct caagcaaata atcaaaaaga gccacggacg 120 
aaatgttaag ccagttcttt ccccacatta aagaaaagcc attgagcgag cgggttaagt 180 
cccgagatgc cttcgtttac ttagatcggg tgatgtggtc ctttggctgg acagtgcctg 240 
aaaacaaaag gtaatgaaat cgatctaagc acatatataa gtattaaagc aaccataatt 300 
tcacgataag gtgggatcta cattacaaac tgtggtcaac tttcgtgaca ttgttgatat 360 
ttatccttct gccgatatcg gtaagcgttg agtatattca gcggttcaag accttctcgg 420 
cgggtgagtt tcttagctca atccagattg gcgttaacat gtacggaagc agctttaaaa 480 
gttatttgac catgatggga tataagaaga gacaggaggc taagatgtca ctggatgagc 54 0 
tggacaagag atgcgtttgt gatgaggaga ggaccattgt acatcgacat gtcgccctgg 600 
gaaacttttg ctatattttc tatcacattg cgtacactag ctttttgatt tcaaactttt 660 
tgtcatttat aatgaagaga atccatgcct ggcgcatgta ctttccctac gtcgaccccg 720 
aaaagcaatt ttacatctct agcatcgccg aagtcattct taggggatgg gccgtcttca 780 
tggatctctg cacggatgtg tgtcctttga tctccatggt aatagcacga tgccacatca 840 
cccttctgaa acagcgcctg cgaaatctac gatcggaacc aggaaggacg gaagatgagt 900 
acttgaagga gctcgccgac tgcgttcgag atcaccgctt gatattggag taagctgttt 960 
aaaaggactt accgtcatca atcttaaata taaatactac cgggttgttt tctatcaatg 1020 
aaatttgttg cacttactgt aatttacaat ttgttttgac aaattttttt atttacctta 1080 
ctttttcagc tatgtcgacg cattgcgatc cgtcttttcg gggacaattt ttgtgcagtt 1140 
cctcttgatc ggtattgtac tgggtctgtc aatgataaat ataatgtttt tctcaacact 1200 
ttcgactggt gtcgccgttg tcctttttat gtcctgcgta tctatgcaga cgttcccctt 12 60 
ttgctatttg tgtaacatga ttatggatga ctgccaagag atggccgact ccctttttca 1320 
atcggactgg acatctgccg atcgtcgcta caaatccact ttggtatact ttcttcacaa 1380 
tcttcagcag cccattattc ttacggctgg tggagtcttt cctatttcca tgcaaacaaa 1440 
tttaaatgtg agtatcaatc aatatgagct ataatgtatg tatgagatta aaattactat 1500 
tcttttatag atggtgaagc tggcctttac tgtggttaca atagtgaaac aatttaactt 1560 
ggcagaaaag tttcaataa 157 9 



<210> 100 
<211> 3500 
<212> DNA 

<213> Drosophila melanogaster 
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<220> 

<223> DOR 22C.1, segment of BDGP Clone No. AC004716 



<220> 

<221> intron 
<222> (1) . . (121) 



<220> 

<221> intron 
<222> (454) . . (516) 

<220> 

<221> intron 
<222> (646) . . (859) 



<220> 

<221> intron 

<222> (1051) . . (1137) 

<220> 

<221> intron 

<222> (1320) . . (2566) 

<220> 

<221> intron 

<222> (2669) . . (3205) 



<220> 

<221> intron 

<222> (3359) . . (3449) 



<400> 100 

tagttccatc gagtgcctat aaaagcggtg gccaacattg cgtctgacac ttgcccggag 60 
cagctgacca ctggagttgc ggtaggagca tcgatagtat ccacttgagt tcatcccaga 120 
aatgactgac agcgggcagc ctgccattgc cgaccacttt tatcggattc cccgcatctc 180 
cggcctcatt gtcggcctct ggccgcaaag gataaggggc gggggcggtc gtccttggca 240 
cgcccatctg ctcttcgtgt tcgccttcgc catggtggtg gtgggtgcgg tgggcgaggt 300 
gtcgtacggc tgtgtccacc tggacaacct ggtggtggcg ctggaggcct tctgccccgg 3 60 
aaccaccaag gcggtctgcg ttttgaagct gtgggtcttc ttccgctcca atcgccggtg 420 
ggcggagttg gtccagcgcc tgcgggctat tttgtgggaa tcgcggcggc aggaggccca 480 
gaggatgctg gtcggactgg ccaccacggc caacaggctc agcctgttgt tgctcagctc 540 
tggcacggcg acaaatgccg ccttcacctt gcaaccgctg attatgggtc tctaccgctg 600 
gattgtgcag ctgccaggtc aaaccgagct gccctttaat atcatgtgag taagctagtt 660 
tgtggtttct aacgctattt cgtgcaactt tattacttac gtaagcatcg cgtatccgtt 720 
aagatgaaaa gttcctcaac caatttggtt gatggcaggt gatacataag taatgattcc 780 
aattgccccc gccagaagca cttaagatac gccatccgcc atccgccatt cgccattaat 840 
cgcagttcct tttaaacaga ctgccctcgt ttgccgtgca gccaggagtc tttccgctca 900 
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cctacgtgct gctgaccgct tccggtgcct gcaccgtttt cgccttcagc ttcgtggacg 960 
gattcttcat ttgctcgtgc ctctacatct gcggcgcttt ccggctggtg cagcaggaca 1020 
ttcgcaggat atttgccgat ttgcatggcg gtgggtctgg ctctgctgcg gcacgatgac 1080 
aatggttttc cctcgctcga tgactttcct tggctttcct ttcccttgca gactcagtgg 1140 
atgtgttcac cgaggagatg aacgcggagg tgcggcacag actggcccaa gttgtcgagc 1200 
ggcacaatgc gattatcgat ttctgcacgg acctaacacg ccagttcacc gttatcgttt 1260 
taatgcattt cctgtccgcc gccttcgtcc tctgctcgac catcctggac atcatgttgg 1320 
tgagcccctt ttcagaggcc ttcctttggg gcgggtatcc ttggggttaa tcaaaaaact 1380 
gttgcaacca ggagtggtac caatatcaaa gcccccttta atattttgcc agtttgtcgc 1440 
gccactggct tttcgcatcg cctgcattcg gcggctgttt taaaagtttt tccctgtttt 1500 
cactgtttgc tgtttttccc tggcttttcc agccgctccg ttctgattcg gttttcccga 1560 
tttgtttgtt tgctttgtgg ctgcggctgc ggctctctcc ggtggcaatt tataagcgca 1620 
tgaattgtta ctgctcccca aaaagttttt aatgcgactt tcgcacgttt tccggtgtcc 1680 
cggacaccgg agacccataa gctgtttgct gcaactggcg aacggagtgg aaatgttcgg 17 4 0 
gaaaggctca tggctcctgg cttctggccc caatgggata aatgctgcca gcaatttctt 1800 
agcccttctg ccgcttcttt ttcggcccaa agtggtgttg agtaatccat tcgaaacttg 1860 
tttatcgctc ggggatcctg cgaccagcga tctattatca ctttttaagt ctgttcgtga 1920 
atggagggct gggaagacct aaaggcacac tgagttttga cacgtgctcg ataagagaaa 198 0 
tatgaataca aagtttgtac ttttccgaag ctatttttat ttgagcaaac ttgtatatcg 2040 
aaaactcatg gttaatagaa atcagtaata tacagtagca gtcccttata aaccttcctt 2100 
acttgaagtt agcaaccttc aagctaatgc caaaacacaa ttaaaataag ctcagaagaa 2160 
gcggaagagc ggaaatgcaa tttattgtaa gccaatcctg ttggggactt ataaaatgga 2220 
gcattgtcac agttttaggc actcgacgac atgcataaga tgcataattc tttttcaatg 2280 
ccacaatggg aggaatgtca ggcgagtgga aaatgggaaa acgagaggaa aagcaatcag 234 0 
actgcagacc atttgccaaa gtgcatttct ctgatgtttt ccattttcgc gactacttca 2400 
agcccgagca cgccacataa aacgacggta tatatacact tatattatat attagataat 2460 
cccctgcact ttttcaccac cagcaatttc gctccaatta atgacgtccc ccccccacac 2520 
ttctcacttt ccgcaccctc caattcttcg tcttccgtct ccgcagaaca cgtcgtcgtt 2580 
gagcggctta acctacatct gctatatcat cgcggcccta acgcagctat tcctctactg 264 0 
cttcggaggc aatcacgtca gcgagagtgt gagtacgact cctgaccaca ccccgaaatc 2700 
tcccccactt tccacttttc cgggctcagc ccattttcat ttccatcact cattcggcct 2760 
tagatactct ggcgaaggag ttgtcactga actcgatgcg attggagtgt tgagtaaact 2820 
aacttaacta aataataacc aagcatagct tacttttaga agcttttatg aatatttatt 2880 
aaattctctt gtttcccatt taaatcctgc catggatttc tccattatca tcttgtgctt 2940 
tcctgctatt aagaatatgc tgtgtgtgta ttttttcacc attcgtgtaa tttccatttg 3000 
acgctaaaca tttagcagcc aggcggccac ataaaaaatg atgtgccatt cacaaattgc 3060 
acttgcgata aaattttaat tacacaacaa agtgtgtaac cgcaccgccc accccagatc 3120 
cacatccaca tccacatcca gctccagatt ccgtgccaca acccattcca ctgactaccg 3180 
actccttctc cttctcttta cacagagtgc ggctgtggcg gacgtgctgt acgacatgga 3240 
gtggtacaaa tgcgatgcga ggactaggaa agtgatttta atgatattgc gccgttcgca 3300 
gcgggcaaaa acaattgcgg tgccgttttt tacgccctca ctgccagcac tccgatctgt 3360 
atgggatgcc atgcgatgcg ttgattatga taataaatgt ttatgtttct gtttccgttt 3420 
ccacttcgat catctgcccg gcactccaga tactcagcac agccggctca tatatcacgc 3480 
tgctaaagac gttcctgtaa 3500 



<210> 101 
<211> 1312 
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<212> DNA 

<213> Drosophila melanogaster 



<220> 

<223> DOR 23A.1, NCBI Accession No. AF127925 
<220> 

<221> intron 
<222> (1) . . (121) 

<220> 

<221> intron 
<222> (838) . . (888) 

<400> 101 

gatttaattt gcgatttaat aacccctgac taatggcgca actgacacag aaaggtgata 60 
taaatggtcg aggtcctttt gaaagtatgc cagtgaaaaa gaaagacacc aagaattcaa 120 
aatgaagctc agcgaaaccc taaaaatcga ctattttcga gtccagttga atgcctggcg 180 
aatttgtggt gccttggatc tcagcgaggg taggtactgg agttggtcga tgctattgtg 240 
catcttggtg tacctgccga cacccatgct actgagagga gtatacagtt tcgaagatcc 300 
ggtggaaaat aatttcagct tgagcctgac ggtcacttcg ctgtccaatc tcatgaagtt 360 
ctgcatgtac gtggcccaac taacaaagat ggtcgaggtc cagagtctta ttggtcagct 420 
ggatgcccgg gtttctggcg agagccagtc tgagcgtcat agaaatatga ccgagcacct 480 
gctaaggatg tccaagctgt tccagatcac ctacgctgta gtcttcatca ttgctgcagt 54 0 
tcccttcgtt ttcgaaactg agctaagctt acccatgccc atgtggtttc ccttcgactg 600 
gaagaactcg atggtggcct acatcggagc tctggttttc caggagattg gctatgtctt 660 
tcaaattatg caatgctttg cagctgactc gtttcccccg ctcgtactgt acctgatctc 720 
cgagcaatgt caattgctga tcctgagaat ctctgaaatc ggatatggtt acaagactct 780 
ggaggagaac gaacaggatc tggtcaactg catcagggat caaaacgcgc tgtataggta 84 0 
atcctactat ctctatgttt ccagttctaa tactcaatct tcctttagat tactcgatgt 900 
gaccaagagt ctcgtttcgt atcccatgat ggtgcagttt atggttattg gcatcaacat 960 
cgccatcacc ctatttgtcc tgatatttta cgtggagacc ttgtacgatc gcatctatta 1020 
tctttgcttt ctcttgggca tcaccgt-gca gacatatcca ttgtgctact atggaaccat 1080 
ggtgcaggag agttttgctg agcttcacta tgcggtattc tgcagcaact gggtggatca 1140 
aagtgccagc tatcgtgggc acatgctcat cctggcggag cgcactaagc ggatgcagct 1200 
tctcctcgcc ggcaacctgg tgcccatcca cctgagcacc tacgtggcct gttggaaggg 1260 
agcctactcc ttcttcaccc tgatggccga tcgagatggc ctgggttctt ag 1312 



<210> 102 
<211> 1532 
<212> DNA 

<213> Drosophila melanogaster 
<220> 

<223> DOR 24D1., segment of DBGP Clone No. AC004371 
<220> 
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<221> intron 
<222> (1) . . (87) 

<220> 

<221> intron 
<222> (403) . . (456) 

<220> 

<221> intron 
<222> (609) . . (689) 



<220> 

<221> intron 

<222> (1063) . . (1114) 



<220> 

<221> intron 

<222> (1214) . . (1268) 



<220> 

<221> intron 

<222> (1422) . . (1481) 



<400> 102 

aatatataca tgcagttcca aaggctataa aggacgaggg tcaaaggatt atgggctgaa 60 
aagtgtttgg tgtcctcgga gagaaagatg ttacctcgat tcctgaccgc ctcctatcca 120 
atggagcgcc attatttcat ggtgccaaag tttgcattat cgctgattgg tttttatccc 180 
gaacagaagc gaacggtttt ggtgaaactt tggagtttct tcaacttttt catcctcacc 240 
tacggctgtt atgcagaggc ttactatggc atacactata taccgattaa catagccact 300 
gcattggatg ccctttgtcc tgtggcctcc agcattttgt cgctggtgaa aatggtcgcc 360 
atttggtggt atcaagatga attaaggagt ttgatagagc gggtaagatt tttaacagag 420 
caacagaagt ccaagaggaa actgggctat aagaagaggt tctatacact ggcaacgcaa 480 
ctaacattcc tgctactatg ctgtggattt tgcaccagta cttcctattc cgtcagacat 540 
ttgattgata atatcctgag acgcacccat ggcaaggact ggatctacga gactccgttc 600 
aagatgatgt aaggaaaggg aagaatggtt tatatatact tttggaacga aataatgatg 660 
tgatctaaac aagatgcact tttttttagg ttccccgatc ttctcctgcg tttgccactc 720 
tatcccatca cctatatact cgtgcattgg catggctaca ttactgtggt ttgttttgtc 780 
ggcgcggatg gtttcttcct ggggttctgt ttgtacttca ctgttttgct gctctgtctg 840 
caggacgatg tttgtgattt actagaggtt gaaaacatcg agaagagtcc ctccgaagcg 900 
gaggaagctc gcatagttcg ggaaatggaa aaactggtgg accggcataa cgaggtggcc 960 
gagctgacag aaagattgtc gggtgttatg gtggaaataa cactggccca ctttgttact 1020 
tcgagtttga taatcggaac cagcgtggtg gatattttat tagtgggtat ttacatttga 1080 
ttagatcctt tcgatatatg ttcttaaatt ctagttttcc ggcctgggaa tcattgtgta 1140 
tgtggtctac acttgtgccg taggtgtgga aatatttcta tactgtttag gaggatctca 1200 
tattatggaa gcggtatatt cataagaaac tactataaag ttacttttaa attcattgca 1260 
tttcttagtg ttccaatcta gcgcgctcca cattttccag ccactggtat ggccacagtg 1320 
ttcgggtcca aaagatgacc cttttgatgg tagctcgtgc tcaacgagtt ctcacaatta 1380 
aaattccttt cttttcccca tcattagaga ctctaacttc ggtaagctta tgcgaaaatg 1440 
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ttatggtaca cacaagtcta catttctatg aggtcttgta gattttgcgc ttcactggat 1500 
ctctgattgc cctggcaaag tcggttatat aa 1532 



<210> 103 
<211> 1691 
<212> DNA 

<213> Drosophila melanogaster 
<220> 

<223> DOR 25A.1, segment of BDGP Clone No. AC005463 
<220> 

<221> intron 
<222> (1) . . (121) 

<220> 

<221> intron 

<222> (898) . . (1192) 

<220> 

<221> intron 

<222> (1551) . . (1613) 

<400> 103 

aagaggaacg cctactaaat tggcaaacat taattaagaa ctacaaatcg cggactataa 60 
aagcattccg agctcctcct atcgccagtt atattatagc cttaaacatt cgaaacttta 120 
gatgttcgga cactttaagc tcgtctatcc ggctcctata tcggagccca tacagtctag 180 
ggattcgaat gcatacatga tggagacgct gcgaaattcg ggcttgaatt tgaagaacga 240 
tttcggtata ggccgcaaga tttggagggt gttttcgttc acctacaata tggtgatact 300 
tcccgtaagt ttcccaatca actatgtgat acatctggcg gagttcccgc cggagctgct 360 
gctgcaatcc ctgcaactgt gcctcaacac ttggtgcttc gctctgaagt tcttcactct 420 
gatcgtctat acgcaccgct tggagctggc caacaagcac tttgacgaat tggataagta 4 80 
ctgcgtgaag ccggcggaga agcgcaaggt tcgcgacatg gtggccacta ttacaagact 540 
gtacctgacc ttcgtcgtgg tctacgtcct ctacgccacc tccacgctac tggacggact 600 
actgcaccac cgtgttccct acaatacgta ctatccgttc ataaactggc gagtcgatcg 660 
gacccagatg tacatccaga gttttctgga gtacttcacc gtgggttatg ccatatatgt 720 
ggccaccgcc accgattcct accctgtgat ttacgtggca gccctgcgaa ctcatattct 780 
cttgctcaag gaccgtatca tttacttggg cgatcccagc aacgagggta gcagcgaccc 840 
gagctacatg tttaaatcgt tggtggattg tatcaaggca cacagaacca tgctaaagta 900 
agtcagtttc attatcctac cggaaatatc aacaaccgtg aattatgttt tcactcaacg 960 
gatctgaatt tattcggaaa actgaataaa agtacaactt ttataagaac tcgctattga 1020 
tatttctgaa tatataagag tatattttta cattcaggat aaaataaatg gtatatcaaa 1080 
cgattcgaca taaaactatt ttatttcaaa taaattagct tacatttttt aaaatatatg 1140 
cattaattta aattttcaat gttaaaaacc catcccactg tttcccatgc agtttttgtg 1200 
atgccattca accaatcatc tctggcacga tatttgccca attcatcata tgcggatcga 1260 
tcctgggcat aattatgatc aacatggtat tgttcgctga tcaatcgacc cgattcggca 1320 
tagtcatcta cgttatggcc gtccttctgc agacttttcc gctttgcttc tactgcaacg 1380 
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ccatcgtgga cgactgcaaa gaactggccc 
aggacaagcg ataccagcgg actgtcatcc 
ccttcaccgc catgaacata tttaacatta 
tgctctcggt tagaacgggg aaggaagcaa 
agttcgcctt caccgtgtac gccatcgcga 
ttaaggaata g 



acgcactttt ccattccgcc tggtgggtgc 1440 
agttcctgca gaaactgcag cagcccatga 1500 
atttggccac taacatcaat gtaagtccac 1560 
agtccgaact tcaatccttg caggtagcca 1620 
gcggtatgaa cctggaccaa aagttaagca 1680 
1691 



<210> 104 
<211> 1333 
<212> DNA 

<213> Drosophila melanogaster 
<220> 

<223> DOR 33B.1, region of BDGP Clone No. AC006240 
<220> 

<221> intron 
<222> (1) . . (121) 

<220> 

<221> intron 
<222> (844) .. (918) 

<400> 104 

ctggaatcaa ttaaatataa ttgcggggac aacaattacg tctacccgac ttaaatttaa 60 
ttattaaaag ggaccagtat aaaaggaaag gcggtacata acttgtccaa ttgtttcaaa 120 
tatggattca agaaggaaag tccgaagtga aaatctttac aaaacctatt ggctttactg 180 
gcgacttctg ggagtcgagg gcgattatcc ttttcgacgg ctagtggatt ttacaatcac 240 
gtctttcatt acgattttat ttcccgtgca tcttatactg ggaatgtata aaaagcccca 300 
gattcaagtc ttcaggagtc tgcatttcac atcggaatgc cttttctgca gctataagtt 360 
tttctgtttt cgttggaaac ttaaagaaat aaagaccatc gaaggattgc tccaggatct 420 
cgatagtcga gttgaaagtg aagaagaacg caactacttt aatcaaaatc caagtcgtgt 480 
ggctcgaatg ctttcgaaaa gttacttggt agctgctata tcggccataa tcactgcaac 540 
tgtagctggt ttatttagta ctggtcgaaa tttaatgtat ctgggttggt ttccctacga 600 
ttttcaagca accgccgcaa tctattggat tagtttttcc tatcaggcga ttggctctag 660 
tctgttgatt ctggaaaatc tggccaacga ttcatatccg ccgattacat tttgtgtggt 720 
ctctggacat gtgagactat tgataatgcg tttaagtcga attggtcacg atgtaaaatt 780 
atcaagttcg gaaaatacca gaaaactcat cgaaggtatc caggatcaca ggaaactaat 840 
gaagtaagaa taaagattta agaaccgcat gtttgatagc tcagagaact gataattaat 900 
caaatgtaac ttttccagga taatacgcct acttcgcagc actttacatc ttagccaact 960 
gggccagttc ctttctagtg gaatcaacat ttccataaca ctcatcaaca tcctgttctt 1020 
tgcggaaaac aactttgcaa tgctttatta tgcggtgttc tttgctgcaa tgttaataga 1080 
actatttcca agttgttact atggaattct gatgacaatg gagtttgata agctaccata 1140 
tgccatcttc tccagcaact ggcttaaaat ggataaaaga tacaatcgat ccttgataat 1200 
tctgatgcaa ctaacactgg ttccagtgaa tataaaagca ggtggtattg ttggcatcga 1260 
tatgagtgca ttttttgcca cagttcggat ggcatattcc ttttacactt tagccttgtc 1320 
atttcgagta tag 1333 
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<210> 105 
<211> 1442 

<212> DNA 

<213> Drosophila melanogaster 



<220> 

<223> DOR 33B.2, region of BDGP Clone No. AC006240 



<220> 

<221> intron 
<222> (1) . . {121) 

<220> 

<221> intron 

<222> (849) . . (1030) 



<400> 105 

tatagggtta tcaaactcgc agtaaattga 
ccatttagaa cgactagtat aaaagaggaa 
catggactta aaaccgcgag tcattcgaag 
ttggcatctt ttgggcctgg aaagcaattt 
tacaattttc gtaaccattt ggtatccaat 
atctttgggg gatgtctgca agggtctacc 
taaatttatt tgttttcgct tcaagctatc 
agagctggat cagcgagctt taagtcgaga 
acgtgaggcg aatttcattt ggaaaagttt 
ggctattgca tcagttcttt tcggcggtgg 
atacgatgtg caggccacgg aactaatatt 
agtaagtttg gccatacttc agaatttggc 
cgtggttgcc ggtcatgtaa gacttttggc 
agaggaaaca atatacttaa ccggaaagca 
actaatgaag taatgtacat atatagaatg 
ttgtaggaaa accattctgt ttgtcgggtg 
atgatattaa atacttcctt gcaaacaatt 
ttatttccag aatagtggaa ttactgcgca 
ttatttcaag tggtgttaat atttccataa 
ataatttcgc tataacctac tacggagtgt 
cgtgctgcta ttacggcacc ctgatatccg 
actcaagtaa ctggatgagt atgaatcgga 
aactcaccct ggcggaagtg cagatcaagg 
ccttctttgc caccgtgcga ttggcctact 
aa 



tgggcgtaac gatgtgggac tataatttag 60 
atgccgcagg accttttcaa ttgtttcaaa 120 
tgaagatatc tacagaacct attggttata 180 
ctttctgaat cgcttgttgg atttggtgat 240 
tcacctgatt ctgggactgt ttatggaaag 300 
aattacggca gcatgctttt tcgccagctt 360 
tgaaattaaa gaaatcgaaa tattatttaa 420 
ggaatgcgag tttttcaatc aaaatacgag 480 
cattgtggcc tatggactgt cgaatatctc 54 0 
acataagcta ttatatcccg cctggtttcc 600 
ttggctaagt gtaacatacc aaattgccgg 660 
caatgattcc tatccaccga tgacattttg 720 
gatgcgcttg agtagaattg gccaaggtcc 780 
attaatcgaa agcatcgagg atcaccgaaa 840 
gtttttagtt attatcatta aatgaacgtg 900 
tcacggaaat cgattttcct taatttacat 960 
atcatattag taatttagaa tctttattat 1020 
gcaccatgaa tatttcgcag ctcggccagt 1080 
cactagtcaa cattctcttc tttgcggata 1140 
acttcctatc gatggtgttg gaattattcc 1200 
tggagatgaa ccagctgacc tatgcgattt 1260 
gctacagccg catcctactg atcttcatgc 1320 
ccggtgggat gattggcatc ggaatgaacg 1380 
ccttcttcac tttggccatg tcgctgcgtt 144 0 
1442 



<210> 106 
<211> 1354 



207 



<212> DNA 

<213> Drosophila raelanogaster 
<220> 

<223> DOR 33B.3, region of BDGP Clone No. AC006240 
<220> 

<221> intron 
<222> (1) . . (120) 

<220> 

<221> intron 
<222> (852) . . (930) 

<400> 106 

tatatatatg aatatatgag catcttcttg ttcatagtgg tataatttct cgcagtgttt 60 
cctctcatga catctgcata aaagctccag ccttgctggg aaatgtgcat taggtgaaca 120 
atggtcatta tcgacagtct tagtttttat cgtccattct ggatctgcat gcgattgctg 180 
gtaccgactt tcttcaagga ttcctcacgt cctgtccagc tgtacgtggt gttgctgcac 240 
atcctggtca ccttgtggtt tccactgcat ctgctgctgc atcttctgct acttccatct 300 
accgctgagt tctttaagaa cctgaccatg tctctgactt gtgtggcctg cagtctgaag 360 
catgtggccc acttgtatca cttgccgcag attgtggaaa tcgaatcact gatcgagcaa 420 
ttagacacat ttattgccag cgaacaggag catcgttact atcgggatca cgtacattgc 480 
catgctaggc gctttacaag atgtctctat attagctttg gcatgatcta tgcgcttttc 540 
ctgttcggcg tcttcgttca ggttattagc ggaaattggg aacttctcta tccagcctat 600 
ttcccattcg acttggagag caatcgcttt ctcggcgcag tagccttggg ctatcaggta 660 
ttcagcatgt tagttgaagg cttccagggg ctgggcaacg atacctatac cccactgacc 720 
ctatgccttc tggccggaca tgtccatttg tggtccatac gaatgggtca actgggatac 780 
ttcgatgacg agacggtggt gaatcatcag cgtttgctgg attacattga gcagcataaa 840 
ctcttggtgc ggtaagcttt gattaactaa cttttgacaa gaagtttatt cactttaact 900 
ggttccaaaa acgatgcact caatgtgcag attccacaac ctggtgagcc ggaccatcag 960 
cgaagtgcaa ctggtgcagc tgggcggatg tggagccact ctgtgcatca ttgtctccta 1020 
catgctcttc tttgtgggcg acacaatctc gctggtctac tacttggtgt tctttggagt 1080 
ggtctgcgtg cagctctttc ccagctgcta ttttgccagc gaagtagccg aggagttgga 1140 
acggctgcca tatgcgatct tctccagcag atggtacgat caatcgcggg atcatcgatt 1200 
cgatttgctc atctttacac aattaacact gggaaaccgg gggtggatca tcaaggcagg 1260 
aggtcttatc gagctgaatt tgaatgcctt tttcgccacc ctgaagatgg cctattccct 1320 
ttttgcagtt gtggtgcggg caaagggtat atag 135 4 



<210> 107 
<211> 1795 
<212> DNA 

<213> Drosophila melanogaster 
<220> 

<223> DOR 43B.1, NCBI Accession No. AF127926 



208 



<220> 

<221> intron 
<222> (1) . . (121) 

<220> 

<221> intron 
<222> (539) . . (692) 

<220> 

<221> intron 

<222> (1013) . . (1075) 

<220> 

<221> intron 

<222> (1176) . . (1319) 

<220> 

<221> intron 

<222> (1422) . . (1487) 

<220> 

<221> intron 

<222> (1596) . . (1690) 

<400> 107 

gcccgcagat tgccctcctc gatggcgaca attggtgcag aataaaagac atttgcccag 60 
cactttgcag cacagtctgg agtggatctg ccggtaagga gcttatagaa agtgccgagc 120 
aatgacaatc gaggatatcg gcctggtggg catcaacgtg cggatgtggc gacacttggc 180 
cgtgctgtac cccactccgg gctccagctg gcgcaagttc gccttcgtgc tgccggtgac 240 
tgcgatgaat ctgatgcagt tcgtctacct gctgcggatg tggggcgacc tgcccgcctt 300 
cattctgaac atgttcttct tctcggccat tttcaacgcc ctgatgcgca cgtggctggt 360 
cataatcaag cggcgccagt tcgaggagtt tctcggccaa ctggccactc tgttccattc 420 
gattctcgac tccaccgacg agtgggggcg tggcatcctg cggagggcgg aacgggaggc 4 80 
tcggaacctg gccatcctta atttgagtgc ctccttcctg gacattgtcg gtgctctggt 540 
atcgccgctt ttcagggagg agagaggtta gtacactcac agaatgggat taaaataacc 600 
gtcattcaaa tcgaaaggat atttgtagag taaaagagtg gaaaattgct aaatagtggt 660 
taaaactgca aagtagtttg tgcattgctt agtttttcga atataaattc ccaattggtg 720 
ttgtcacttt tttccttcca gctcatccct tcggcttagc tctaccagga gtgagcatga 780 
ccagttcacc cgtctacgag gttatctact tggcccaact gcctacgccc ctgctgctgt 8 40 
ccatgatgta catgcctttc gtcagccttt ttgccggcct ggccatcttt gggaaggcca 900 
tgctgcagat cctggtacac aggctgggcc agattggcgg agaagagcag tcggaggagg 9 60 
agcgcttcca aaggctggcc tcctgcattg cgtaccacac gcaggtgatg cggtgagccg 1020 
caggggaaga aatcgggggt tggggaaatc tctaagctgc tcatttccat ttcagctatg 1080 
tgtggcagct caacaaactg gtggccaaca ttgtggcggt ggaagcaatt atttttggct 1140 
cgataatctg ctcactgctc ttctgtctga atattgtagg ccactgattt tgagttcggg 1200 
tgccacagag cctccataaa ttggaagaac caaaactttg ggatggcggt cttctatggc 1260 
actgtggcac tccgagatct aagactgtgg tccttttaaa aataccatta ccattccaga 1320 
taacctcacc cacccaggtg atctcgatag tgatgtacat tctgaccatg ctgtacgttc 1380 



209 



tcttcaccta ctacaatcgg gccaatgaaa tatgcctcga ggtgatacta ccaactccat 1440 
tcaatttgat agctatccat gactccttat aacttgtatc catgcagaac aaccgggtgg 1500 
cggaggctgt ttacaatgtg ccctggtacg aggcaggaac tcggtttcgc aaaaccctcc 15 60 
tgatcttctt gatgcaaaca caacacccga tggaggtggg tgggcattgg gtgcatttca 1620 
tttgcgagtc attgcgagga ctgtaatcca tttccattcc gggtgctgtg cgctccattt 1680 
catcttgcag ataagagtcg gcaacgttta ccccatgaca ttggccatgt tccagagtct 1740 
gttgaatgcg tcctactcct actttaccat gctgcgtggc gtcaccggca aatga 1795 



<210> 108 
<211> 1426 
<212> DNA 

<213> Drosophila melanogaster 
<220> 

<223> DOR 46F.1, region of BDGP Clone No. AC005974 
<220> 

<221> intron 
<222> (1) . . (120) 

<220> 

<221> intron 

<222> (949) . . (1040) 

<220> 

<221> intron 

<222> (1130) . . (1185) 

<400> 108 

ataatttaat gattaaccaa cgattaggtc gtctaatgac attttaaatg ccgttcagaa 60 
gccttataaa cacttgaacg aatcaattaa tcttaagttc tggaaaaatc tggacgtgaa 120 
atgagcaaag gagtagaaat cttttacaag ggccagaagg cattcttgaa catcctctcg 180 
ttgtggcctc agatagaacg ccggtggaga atcatccacc aggtgaacta tgtccacgta 240 
attgtgtttt gggtgctgct ctttgatctc ctcttggtgc tccatgtgat ggctaatttg 300 
agctacatgt ccgaggttgt gaaagccatc tttatcctgg ccaccagtgc agggcacacc 3 60 
accaagctgc tgtccataaa ggcgaacaat gtgcagatgg aggagctctt taggagattg 420 
gataacgaag agttccgtcc tagaggcgcc aacgaagagt tgatctttgc agcagcctgt 4 80 
gaaagaagta ggaagcttcg ggacttctat ggagcgcttt cgtttgccgc cttgagcatg 540 
attctcatac cccagttcgc cttggactgg tcccaccttc cgctcaaaac atacaatccg 600 
cttggcgaga ataccggctc acctgcttat tggctcctct actgctatca gtgtctggcc 660 
ttgtccgtat cctgcatcac caacatagga ttcgactcac tctgctcctc actgttcatc 720 
ttcctcaagt gccagctgga cattctggcc gtgcgactgg acaagatcgg tcggttaatc 780 
actacttctg gtggcactgt ggaacagcaa cttaaggaaa atatccgcta tcacatgacc 840 
atcgttgaac tgtcgaaaac cgtggagcgt ctactttgca agccgatttc ggtgcagatc 900 
ttctgctcgg ttttggtgct gactgccaat ttctatgcca ttgctgtggt gagctgtgaa 960 
ttcgcaacaa gaagactatc ataaatgctt tctagaaaaa caatggtctt aatttgtttt 1020 
ccctatatac ataattgtag ttatctgacg agaggctgga gctctttaag tatgtgacct 1080 



210 



atcaggcgtg catgttgatt cagattttta 
atatccgaaa tgcctcccca ggccagtaat 
cagcctggac cttccgcacg agctgtacaa 
ccgaaggatt gcgctcctct ttatgcaacg 
taatccaagt cttggttttg acttaatgct 
gacttttttg tgcactgtag ccaatttcca 



tattgtgcta ctatgccggg tgggtattcc 1140 
gaatttcatc gtcagtgagg taacccagcg 1200 
gacctcctgg gtggactggg actacaggag 1260 
ccttcactcg accttgagga ttaggacact 1320 
cttcagctcg gtgagttctt tccgtgtttt 1380 
taatgaggct cattag 1426 



<210> 109 
<211> 1337 
<212> DNA 

<213> Drosophila melanogaster 



<220> 

<223> DOR 46F.2, region of BDGP Clone No. AC005974 



<220> 

<221> intron 
<222> (1) . . (121) 



<220> 

<221> intron 

<222> (1223) . . (1283) 



<400> 109 

gcactgtagc caatttccat aatgaggctc 
aggctttcct cgtgcggttc atgatggaaa 
gatggttacg gaggactttt ataagtacca 
gcagctcccc acttgggccg cagaccacca 
catcctggtc atcctgttca tcatgctgct 
ttcccaagtt agggagatcc taaaggtatt 
ggccaaatta ttgcatttga agttgaagag 
gttgtcccca gagttcggcg ttaaaagtga 
agtggcggtt gtccgcatga ggaactccta 
gatccttata gttccctgtt tcgacaactt 
atgcagcatc gagggatgga tctgctattg 
gctgcccact tgtgtgctga atataaccta 
cttgaaggtt cagctacaaa tgctggtcct 
accccaggat aatgagaaaa tcgcaatgga 
gattgttcgt ttcaaggacc tggtggagct 
catgtgttct gttctggtgc tggtgtccaa 
aaacggcgat gccatcttta tgctcaagac 
gatcttcatc atttgctacg cctccaacga 
cagcatctac agctcccaat ggacgggatg 
catgatgcag cgctttaatt ccccgatgct 
cagcttggag gcctttggtt ctgtagggca 
tggttatgct cttctccttt cagatcgtca 
agcgcgtcaa cagttaa 



attagcgaat ttaagttcgt ggcctataaa 60 
agtgaaaatt gtcagcatgt ctcagcatca 120 
ggtgtggtac ttccaaatcc ttggtgtttg 180 
gcgtcgtttt cagtccatga ggtttggctt 240 
gcttttctcc ttcgaaatgt tgaacaacat 300 
cttcatgttc gccacggaaa tatcctgcat 360 
ccgcaaactc gctggcttgg ttgatgcgat 420 
acaggaaatg cagatgctgg aattggatag 480 
cggcatcatg tccctgggcg cggcttccct 540 
tggcgagcta ccactggcca tgttggaggt 600 
gtcgcagtac cttttccact cgatttgcct 660 
cgactcggtg gcctactcgt tgctctgttt 720 
gcgattagaa aagttgggtc ctgtgatcga 7 80 
actgcgtgag tgtgccgcct actacaacag 8 40 
gttcataaag gggccaggat ctgtgcagct 900 
cctgtacgac atgtccacca tgtccattgc 960 
ctgtatctat cagctggtga tgctctggca 1020 
ggtaactgtc cagagctcta ggttgtgtca 1080 
gaacagggca aaccgccgga ttgtccttct 1140 
cctgagcacc tttaacccca cctttgcttt 1200 
gcagaaattc ctttatatat catttattac 1260 
actgctccta cagctacttc gcactgctga 1320 
1337 
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<210> 110 
<211> 1466 
<212> DNA 

<213> Drosophila melanogaster 



<220> 

<223> DOR 47E.1, NCBI Accession No. AF156880 



<220> 

<221> intron 
<222> (1) . . (121) 



<220> 

<221> intron 
<222> (423) . . (478) 



<220> 

<221> intron 
<222> (669) . . (728) 

<220> 

<221> intron 

<222> (1345) . . (1415) 



<400> 110 

gggtattaaa gagctgtact ttcgcatctg tatttcattt attcctactc aattgaaagg 60 
tttaaaaccc ctgaactgaa cacacttgac ttagtgtgag gccgaattaa cccttgtcga 120 
catggacagt tttctgcaag tacagaagag caccattgcc cttctgggct ttgatctctt 180 
tagtgaaaat cgagaaatgt ggaaacgccc ctatagagca atgaatgtgt ttagcatagc 240 
tgccattttt ccctttatcc tggcagctgt gctccataat tggaagaatg tattgctgct 300 
ggccgatgcc atggtggccc tactaataac cattctgggc ctattcaagt ttagcatgat 360 
actttactta cgtcgcgatt tcaagcgact gattgacaaa tttcgtttgc tcatgtcgaa 420 
tggtgagttg taatccattt cggccagaat gtgtatcatt tcatttatta ttttatagag 4 80 
gcggaacagg gcgaggaata cgccgagatt ctcaacgcag caaacaagca ggatcaacga 540 
atgtgcactc tgtttaggac ttgtttcctc ctcgcctggg ccttgaatag tgttctgccc 600 
ctcgtgagaa tgggtctcag ctattggtta gcaggtcatg cagagcccga gttgcctttt 660 
ccctgtctgt atgtacaaat gatatatatg atatatggtg atcaagttat caggctttgt 720 
tcctaaagtt ttccctggaa tatccacatc attcgcaatt atgttttgag cttcatctgg 780 
agcgctttcg cctcgacagg tgtggtttta cctgctgtca gcttggatac catattctgt 840 
tccttcacca gcaacctgtg cgccttcttc aaaattgcgc agtacaaggt ggttagattt 900 
aagggcggat cccttaaaga atcacaggcc acattgaaca aagtctttgc cctgtaccag 960 
accagcttgg atatgtgcaa cgatctgaat cagtgctacc aaccgattat ctgcgcccag 1020 
ttcttcattt catctctgca actctgcatg ctgggatatc tgttctccat tacttttgcc 1080 
cagacagagg gcgtctacta tgcctcattc atagccacaa tcattataca agcctatatc 1140 
tactgctact gcggggagaa cctgaagacg gagagtgcca gcttcgagtg ggccatctac 1200 
gacagtccgt ggcacgagag tttgggtgct ggtggagcct ctacctcgat ctgccgatcc 1260 



212 



mm 



ttgctgatca gcatgatgcg ggctcatcgg 
gcaaacatgg aggccttctc atcggtgggt 
tacatgaata ctcttttcta actttccgtt 
tcacaatgct gagatcattc tcctaa 



ggattccgca ttacgggata ctttttcgag 1320 
gaatcatttc cattgtacaa tacatggatt 1380 
tttagattgt tcgcacggcg atgtcctaca 1440 
1466 



<210> 111 
<211> 1724 
<212> DNA 

<213> Drosophila melanogaster 
<220> 

<223> DOR 47E.2, region of BDGP Clone No. AC005638 
<220> 

<221> intron 
<222> (1) . . (121) 

<220> 

<221> intron 
<222> (256) . . (357) 

<220> 

<221> intron 
<222> (491) . . (552) 

<220> 

<221> intron 

<222> (1071) . . (1119) 

<220> 

<221> intron 

<222> (1328) . . (1456) 

<220> 

<221> intron 

<222> (1616) . . (1673) 

<400> 111 

ctcggtgggc gtttttttgc aaaacaattg cttataaaca tggtaattag accctggagg 60 
cctccttgta catataaaag gaactgaccg cccgcctgga gctcagttca tactttgcga 120 
aatgaacgac tcgggttatc aatcaaatct cagccttctg cgggtttttc tcgacgagtt 180 
ccgatcggtt ctgcggcagg aaagtcccgg tctcatccca cgcctggctt tttactatgt 240 
tcgcgccttt ctgaggtgaa tctggcatcg tccatcactc gatccaatct tttataatcc 300 
acatttttgc catacccaca gtttgctgtg ccagtatccc aacaagaagt tggccagctt 360 
gcccctgtac cgatggatca acttgttcat catgtgcaat gtgatgacca ttttctggac 420 
catgttcgtg gccctgcccg agtcgaagaa cgtgatcgaa atgggcgacg acttggtttg 480 
gatttcgggg gtgagattta ctgattcacc cactcgcaaa tgaacaaact aatggttatt 540 



213 



tctgggtgat agatggcact ggtgttcacc 
gagatcgatg aacttatttc ggattttgaa 
atcgatgagg aggtgttggg ttggcagaga 
atcaactgct tttgcctggt caacttcttc 
ggcgagggaa agctgccctt ccacagcgtc 
catccctaca cgttctggtt cctctacatc 
atgagcattc taatggtgga tatggtaggc 
ctcaagttgc tttgcatcga gataaggaaa 
ttccacgagg agttttgtcg tgtggttcgc 
tgtgccgagt tacaagtact acacgcgttt 
agctttcaat ggcgccttca atgcacaatt 
cactttcgag accatggctg cagcggctgt 
tctcatgctg gtggcattca ttcaactgtc 
tactcaggta tgtcgtttaa aatctttgaa 
tagctgcacc tagatattga aaccttattg 
caactttcat caatagtcag tggaggtggc 
caccaaatcg ccaggcatcc agagggatat 
cctgatgtat gtggccgaac catttctgcc 
gatataaaaa tgtatataat aaagatatca 
agaactgcta tcgtttgctg gccctgatgc 



aagatctttt acatgcattt gcgttgcgac 600 
tactacaacc gggagctgag accccataat 660 
ctgtgctacg tgatagaatc gggtctatat 720 
agtgccgcta ttttcctgca acctctgttg 780 
tatccgtttc aatggcatcg cttggatctg 840 
tggcagagtc tgacctcgca gcacaaccta 900 
atttccacgt tcctccagac ggcgctcaat 960 
ctgggggaca tggaggtcag tgataagagg 1020 
ttccaccagc acattatcaa gtgagttttg 1080 
ttttggcaga ttggtgggga aagccaatag 1140 
aatggccagt ttctccctga tttccatatc 1200 
ggatcccaaa atggccgcca agttcgtgct 1260 
gctttggtgc gtctctggaa ctttggttta 1320 
aactttttaa caatcactcc ttttcctaat 1380 
aaatttactt attttgagat ttttcctgtt 1440 
tcaggctgct tttgatatca acgattggca 1500 
atcctttgtg atactacgag cccagaaacc 1560 
cttcaccctg ggaacctata tgcttgtaag 1620 
cgatatcaac cgtccatttt caggttctga 1680 
aagaatcgat gtag 1724 



<210> 112 
<211> 1318 
<212> DNA 

<213> Drosophila melanogaster 
<220> 

<223> DOR 59D.1, region of BDGP Clone No. AC005672 
<220> 

<221> intron 
<222> (1) . . (120) 

<220> 

<221> intron 
<222> (846) . . (903) 

<400> 112 

taattgaggt ggccacgcaa ggctaattac ataattgggc aatcgcaccg attgagccat 60 
cacccaaaac ggtatttaag tgtggagaac cttacattaa agccgagaat tcaaggaacc 120 
atggcagagg tcagagtgga cagtctggag tttttcaaga gccattggac cgcctggcgg 180 
tacttgggag tggctcattt tcgggtcgag aactggaaga acctttacgt gttttacagc 240 
attgtgtcga atcttctcgt gaccctgtgc taccccgttc acctgggaat atccctcttt 300 
cgcaaccgca ccatcaccga ggacatcctc aacctgacca cctttgcgac ctgcacagcc 360 
tgttcggtga agtgcctgct ctacgcctac aacatcaagg atgtgctgga gatggagcgg 420 
ctgttgaggc ttttggatga acgcgtcgtg ggtccggagc aacgcagcat ctacggacaa 480 
gtgagggtcc agctgcgaaa tgtgctatac gtgttcatcg gcatctacat gccgtgtgcc 540 
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ctgttcgccg agctatcctt tctgttcaag 
tttcccttcg actggctgca ctccaccagg 
gtgggcatct cgtttcagct gctgcaaaac 
ctgtgcctga tctcatccca catcaaaatg 
gatccagcca gagatgcgga gaaggacctg 
ctagagtggg caggcggctc attgtaacgt 
cagactattc cgacgcatcg aggccttcat 
gaccgccttg aatgtgtgca tcggtttagc 
ggcacggatg tacttcatct tctactccct 
ctttttcggc accgacaacg agtactggtt 
caattggcac acacagaaca ggagctttaa 
gttgaagaag agcaccgctg tggctggcgg 
ttccacccta aagggggcct actccctctt 



gaggagcgcg gtctgatgta tcccgcctgg 600 
aactattaca tagcgaacgc ctatcagata 660 
tatgttagcg actgctttcc ggcggtggtg 720 
ttgtacaaca gattcgagga ggtgggcctg 780 
gaggcctgca tcaccgatca caagcatatt 840 
tcgtgttcta ttcactttcc aacttttttc 900 
ttccctgccc atgctaattc agttcacagt 960 
agccctggtg tttttcgtca gcgagcccat 1020 
ggccatgccg ctgcagatct ttccgtcctg 1080 
cggacgcctc cactacgcgg ccttcagttg 114 0 
gcggaaaatg atgctgttcg ttgagcaatc 1200 
aatgatgcgt atccacctgg acacgttctt 1260 
taccatcatt attcggatga gaaagtag 1318 
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